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We review the description of inclusive single unpolarized light hadron production using fragmen- 
tation functions in the framework of the factorization theorem. We summarize the factorization 
of quantities into perturbatively calculable quantities and these universal fragmentation functions, 
and then discuss some improvements beyond the standard fixed order approach. We discuss the 
extraction of fragmentation functions for light charged (tt*, and p/p) and neutral (Ks and 
A/ A) hadrons using these theoretical tools through global fits to experimental data from reactions 
at various colliders, in particular from accurate e^e~ reactions at LEP, and the subsequent success- 
ful predictions of other experimental data, such as data gathered at HERA, the Tevatron and RHIC 
from these fitted fragmentation functions as allowed by factorization universality. These global 
fits also impose competitive constraints on as{Mz)- Emphasis is placed on the need for accurate 
data from ep and pp{p) reactions in which the hadron species is identified in order to constrain the 
separate fragmentation functions of each quark flavour and hadron species. 
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1. INTRODUCTION 

Fragmentation functions (FFs) constitute one of the most important free inputs required for a comprehensive 
description of most collider processes to which perturbative QCD is applicable, being a necessary ingredient in 
any sufficiently complete calculation of processes involving detected hadrons in the final state. They quantify the 
hadronization of quarks and gluons which must eventually occur in every process in which hadrons are produced. While 
parton distribution functions (PDFs), another of the important free inputs, are relevant for collisions involving at least 
one hadron, most importantly at present for pp collisions at the LHC, FFs are relevant in principle for all collisions, 
even those without any initial state hadrons such as electron-positron collisions at the future ILC. Furthermore, while 
knowledge and application of PDFs is limited to the types of hadrons that can be practically used in the initial 
state, being almost exclusively nucleons, FFs can be constrained by, and/or used for the predictions of, measurements 
of the inclusive productions of neutral and charged hadron species ranging from the almost massless to the very 
heavy. Such a large range of processes provides a large range of information on hadronization, and hence provides 
an important contribution to our understanding of non perturbative physics in general, and allows for a particularly 
incontrovertible phenomenological determination of the applicability and limitations of the various approximations 
used in the context of QCD factorization. Such data are also sufficiently accurate to allow for competitive extractions 
of as{Mz), the strong coupling constant of QCD evolved for convenience to the Z-pole mass Mz and the remaining 
of the most important free inputs, which improves the accuracy of perturbative QCD calculations in general and 
imposes constraints on new physics. We note here that while other inputs such as higher twist, multi-hadron FFs, 
fracture functions, and so on become important in certain kinematic limits, FFs are always necessary for a complete 
description of inclusive hadron production. 

In this article, we review the progress in understanding the hadronization of partons, embodied in FFs, and their 
application to inclusive hadron production. The concept of fragmentation was first introduced by Feynman and 
Field in 1977 [l| to explain the limited transverse momenta and energy fraction scaling of hadrons in jets produced 
in e'^e^ collisions, as well as the presence of a few high transverse momentum (px ^ 2) GeV hadrons in hadron- 
hadron collisions. In the latter case, the collision of a pointlike constituent particle of one hadron with that of 
another produces a pair of particles whose directions of motion are opposite to one another but at any angle to the 
parent particles, including large angles. These elementary particles must eventually hadronize to produce the high 
Pt hadrons. Intuitively, any inclusive single hadron production processes may be predicted by first calculating the 
equivalent partonic process (i.e. replacing the produced hadron by a parton and the inclusive sum over hadron final 
states by partonic ones), then allowing the produced parton to hadronize. This parton model is made more precise by 
the factorization theorem. 

Our discussion will be limited to inclusive single unpolarized hadron production, being better understood than its 
polarized counterpart. Final state partons will on average hadronize to hadrons of all species that are not kinematically 
forbidden by the particular reaction, but, of all the charged hadrons, partons will mostly hadronize to the 3 lightest 
ones, being tt^, and p/p, so their FFs are the most phenomenologically well constrained. (For a concise summary 
of the properties of some known mesons and baryons, see Ref. [2j.) The species of neutral meson and neutral baryon 
most likely to be produced in the hadronization process are 7r°, and n/n respectively, but their FFs are accurately 
approximated by the FFs for tt^ and p/pdue to SU(2) (nuclear) isospin symmetry between the u and d quark fiavours. 
The next lightest species of meson and baryon is Kg and A/ A respectively, whose FFs have therefore also been greatly 
studied. {K1 is not usually observed in experiment because it takes a long time to decay into charged particles.) 
Together with 77, /o, p^, p, ui, K*^, 77', gq and (j) mesons, whose FFs are unfortunately either unknown or rather 
poorly known at present, the particles mentioned in this paragraph complete the list of known hadrons which have a 
mass less than or equal to the A/ A mass and which are the most copiously produced in hadron production, and thus 
measurements of their production lead to a rather comprehensive picture of the hadronic final state. In this review 
we shall focus on the productions of these particles only, with the exceptions of the productions of those mesons just 
listed. 

Much of the techniques for global fitting of PDFs can be carried over to global fitting of FFs, for example the 
treatment of systematic errors on the experimental data and the propagation of experimental errors to the FFs, as 
well as the techniques used for the minimization of chi-squared. On the theoretical side, much of the formalism of 
both the fixed order (FO) calculations and its improvement via e.g. large x resummation and incorporation of heavy 
quarks and their masses, is also very similar. 

To date, the subject of fragmentation has been important in various areas of phenomenology and we give an 
incomplete list of examples: Phenomenological constraints on the nucleon PDFs for polarized quarks and antiquarks 
separately, which are important for the determination of the source of nucleon spin, can be obtained from measurements 
of semi-inclusive DIS with polarized initial state particles for the inclusive production of single unpolarized light 
charged hadrons using knowledge of the FFs for these particles Valence quark PDFs can be similarly extracted, 
but from data for which the initial state particles are unpolarized In both cases, the differences between FFs for 
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positively and negatively charged particles, are required, which are relatively poorly constrained by data from e.g. pp 
reactions, and not by accurate e+e~ reaction data. Polarized pp{p) reaction data from e.g. RHIC can also impose 
constraints on the nucleon PDF for the polarized gluon and measurements of photoproduction in polarized ep 
reaction at e.g. the proposed cRHIC [7] or LHeC colliders can impose constraints on the photon PDF for polarized 
partons [9||- FFs provide a consistency check on transverse momentum dependent FFs, which replace FFs when the 
transverse momentum of the produced hadron is measured in addition to the longitudinal, since formally the FFs 
are reproduced by integrating them over the transverse momentum. The suppression of tt^ production in heavy 
ion collisions (Au Au) relative to pp collisions [l^ measured by the PHENIX collaboration at RHIC, and to be 
further investigated by the ALICE, ATLAS and CMS collaborations at the LHC, provides information on the much 
anticipated scenario of a quark-gluon plasma (QGP) that filled the universe the first ten millionths of a second after 
the big bang and created the primordial matter. Such studies may therefore contribute to our understandin g of 
both cosmology and the physics of non perturbative QCD. Photons produced by partonic fragmentation [Til. Il2j| 



contribute significantly to the photonic background of various direct photon signals in DIS, such as that of Higgs 
boson production from hadrons. Fragmentation of a quark to a lepton pair or, equivalently, to a virtual photon 
is perturbatively calculable if the invariant mass is much greater than Aqcd, in contrast to FFs for hadrons and 
the photon, so measurement of the polarization of the virtual photon that decays to the lepton pair of the Drell-Yan 
cross section can be used to test models of the formation of J/ip particles, whose production from fragmentation is 
otherwise similar to that of the virtual photon. In Ref. fis'l it has been proposed to identify exotic hadrons such as 
tetraquarks by identifying properties of FFs which are typical for quarks whose flavour is favoured, i.e. that are of 
the same flavour as any of the produced hadron's valence quarks. Some studies of the fragmentation of squarks and 
gluinos into hadrons have been performed in Ref. and much more recently into supersymmetric hadrons in Ref. 
|17| . which may be relevant at the LHC. The supersymmetric extension of the DGLAP evolution of FFs for light 
charged hadrons has been studied in Ref. [Tsj in the context of ultra high energy cosmic rays. FFs for light charged 
hadrons are required for the calculation of hadronic signatures of black hole production at the LHC [19j| . 

The rest of this review is structured as follows. The basic results of the QCD factorization theorem, which forms 
the starting point of all calculations of inclusive hadron production, are given in section[2l in particular the separation 
from the overall cross section of the process dependent parts, which can be calculated using the FO approach to 
perturbation theory. (The derivation of these results is outlined in appendix Then in section [3] we discuss the 
extraction of FFs from accurate e'^e~ data and from various well motivated non perturbative assumptions. The 
ability of calculations using these fitted FFs to reproduce measurements is studied in section 31 Then we turn to more 
recent progress: Improvements to the standard FO approach that have not been incorporated into nearly all global 
fits are given in section[5l namely hadron mass effects and large x resummation. The treatment of experimental errors 
and their propagation to fitted quantities, which has been rather comprehensively implemented in PDF fits but to a 
somewhat lesser degree in FF, is discussed in section [6l The 3 most recent global fits, in which the implementation 
of these improvements can be found, are discussed and compared in section [71 Finally, in section [HI we examine the 
improvement of the standard FO approach at small x by resummation of soft gluon logarithms, which has so far been 
successfully tested at LO, and we consider what is needed for a full treatment of soft gluon logarithms to NLO. In 
section [HI we predict the future experimental and theoretical progress of FF extraction, and give a summary of the 
current progress in section [TUl The LO splitting functions are given in appendix IB] for reference, the relevant Mellin 
space formulae in appendix [Cl and a summary of all data that can be reasonably reliably calculated and which can 
be used to provide constraints on FFs in appendix [D] Some of these issues have also been discussed very recently in 
Ref. 20], with an aim towards the modification of fragmentation in QCD media. 



Intuitively, the detected hadron h of inclusive single hadron production events is produced in the jet formed by a 
parton a produced at a short distance \/Mf, and carries away a fraction z of the momentum of a. This "probing 
scale" Mf should be of order of the energy scale of the process, whose precise choice is somewhat conventional. The 
probability density in z for this to take place is given by the FF D'^{z, M'j). The hadronic cross section is then equal to 
the cross section with h replaced by a, hereafter referred to as the partonic cross section, weighted with the FF of a and 
summed and integrated over all degrees of freedom such as a and z, i.e. eq. (|T|) below. According to the factorization 
theorem, which follows from QCD in a model independent way, the leading twist component of any inclusive single 
hadron production cross section takes this intuitive form, where the FFs are process independent or universal among 
different initial states. The factorization theorem also asserts that all processes in the partonic cross section that 
have energy scale below the factorization scale Mf factor out of the partonic cross section and are accounted for by 
the FFs, and that the resulting factorization scale dependence of these FFs may be calculated perturbatively. In this 
section, we highlight the results of the factorization theorem. An outline of the formal derivation of the factorization 




2. RESULTS OF THE QCD FACTORIZATION THEOREM 



6 



theorem is given in appendix \X\ for the interested reader. 

1. Factorized cross sections 

In general, as weh as Es, an inclusive single hadron production process depends on the fraction x of the available 
momentum or energy carried away by the detected hadron h. For example, in e'^e~ —^"f*—^h + X, whose kinematics 
are specified in Fig. [1] (left), x = 2ph/y/s and Eg = y/s. We will assume for now that it is reasonable to neglect the 
effect of the hadron mass ruh, which is of 0(rn^/pfj when ^ Ph, where ph is the momentum of the detected 
hadron. When the effect of hadron mass, to be discussed in section [5TT1 is taken into account in the calculations, the 
scaling variable x of the factorization theorem and the fractions Xp and xe of the available momentum and energy 
respectively of the process taken away by the detected hadron must be distinguished from one another. Note that the 
cross section may depend on other variables in addition to x and Eg, but we will not indicate this explicitly unless 
necessary. The factorization theorem asserts that the cross section takes the form of a convolution 

d<y\x,El)= J2 / dzda^ i-,j^,^,as{Mf) \ Dl{z,Mf), (1) 

i=-n;^ \ f ^ / 

up to higher twist terms, which are suppressed relative to the overall cross section by a factor 0(Aqcd/£'s) or more. 
The parton label i = for the gluon, while i — {—)! for (anti)quarks, where I = 1, ... ,6 corresponds respectively to 
the flavours d, u, s, c, b and t. The fact that n/ ^ 6 necessarily will be explained below, mi is a renormalization 
scheme-dependent mass associated with parton i, which will be taken to be its pole mass, and = as/(27r) is the 
expansion parameter in perturbative series. The da* are the equivalent partonic cross sections obtained by replacing 
the detected hadron h with a real on-shell parton i moving in the same direction but with momentum pt/ z, and the 
sum over unobserved hadrons replaced with a sum over unobserved partons. In e.g. e^e~ — *■ /i + X, da^ is completely 
calculable in perturbation theory, while for processes involving initial state hadrons, such as ep ^ e + h + X at 
HERA and pp{p) ^ h + X a.t the LHC (RHIC), dcr' will be convolutions of pcrturbatively calculable quantities with 
PDFs for each initial state hadron, a result which also follows from the factorization theorem. The dcr* are otherwise 
pcrturbatively calculable if all subprocesses with energy scale below some arbitrary factorization scale Mf ^ Aqcd 
are factored out of them and into the FFs (and PDFs if applicable) according to the factorization theorem. While 
the da'' differ from process to process, the FFs are universal and therefore, through them, measurements of one process 
impose constraints on others in which the same hadron is produced. 




FIG. 1: Amplitude for the process e'^e ^* qj -\- ^ h + X (left). Time flows from left to right. The virtual photon 
momentum q obeys q^ = s, where is the center-of-mass energy. The blob contains all QCD processes consistent with 
its external propagators or legs. Also shown is a schematic illustration of the leading twist component of this process after 
factorization (right), involving an intermediate real parton i that fragments to the detected hadron h. The top right blob and 
its external lines represents the FF _D^, while the rest of the diagram is the equivalent partonic cross section, i.e. with h replaced 
by i. 

In more detail, the perturbative expansion of the unfactorized da^ in the limit Eg 3> mi contains potential mass 
singularities, being logarithms of the form \n{mi/Es) raised to powers of integers and, because the largest power in 
a given order in Os grows with the order, they may spoil the convergence of the series even when mi ^ 0. These 
potential mass singularities, which arise from energy processes much below Eg, may be factored out of the dcr*. There 
is clearly some freedom in choosing whether to place each process of energy scale around Mf in dcr* or _D^, and this 
choice defines the factorization scheme. (This is a physical definition — in practice the scheme is fixed by the choice 
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of subtraction terms in the factorization.) Although the leading twist component of da^ is formally independent of 
the choices of factorization scale and scheme, its perturbative approximation discussed above will depend on these 
choices, which must therefore reflect the physics of the overall process. The theoretical error in this approximation can 
also be estimated by varying these choices. The potential mass singularities from (anti)quarks with mass mj ^ 
are dampened by factors of Es/mj, i.e. they approximately decouple [21] and so must not be factorized to avoid 
introducing large uncanceled counterterms in da*. Therefore, the active partons, being those partons whose potential 
mass singularities are factored out of the dai and into the FFs, and to which the summation over i in eq. ([1]) is 
restricted so that n/ labels the number of species of active quarks, should be limited to include only those partons 
whose masses rrii < Eg. This will always include the gluon and, since perturbative QCD is only valid in the region 
Es S> Aqcd, also the light quarks, defined to be those quarks whose masses are of O(Aqcd) or less, i.e. the 3 lightest 
quarks d, u and s or / = 1,2, 3. The intrinsic light hadron PDF of a quark is expected to be of 0{mh/mj) or less 
[22| . but the same property may not necessarily hold for FFs. Therefore, it may also be necessary to always include 
some of the heavy quarks, defined to be those quark whose mass mj ^ Aqcd, i-e. c, b and i or / = 4,5,6, in the 
list of active partons, in which case the cross section will not be perturbatively calculable if mj ^ Eg. For example, 
it could happen that intrinsic charm quark fragmentation is deemed important in a cross section, but it cannot be 
incorporated into the cross section's calculation if the only appropriate scheme is the 3 flavour one. This problem 
would be avoided if a method was known for correctly incorporating the intrinsic FF of quark J in a cross section 
when this quark is not active. This issue will be discussed further at the end of subsection 12 .31 in the light of matching 
conditions between quantities deflned with different numbers of active partons. 

Note that there is another type of potential mass singularity appearing in the cross section, which behaves like a 
power of \n{Es/mj) for any heavy quark mass mj ^ Eg. These can be absorbed into the strong coupling constant 

by using a renormalization scheme for which quark J is not active. 

The rifth scheme is a renormalization and factorization scheme for which the number of active quark flavours is 
rif. Results are usually presented in the n/th Collins- Wilczek-Zee (CWZ) scheme 23|, also known as the decoupling 
scheme, which reduces to the MS scheme for only n/ massless quark flavours in the limit that the rif active quark 
masses vanish and the remaining quarks' masses become infinite so that they completely decouple from the theory. 
The unsubtracted partonic cross sections with heavy quarks and their masses included and their subtraction terms in 
the CWZ schemes have been calculated for e+e^ and ep reactions in Refs. [U, [2^ and [l^ respectively. The inclusion 
of heavy quarks and their masses in the partonic cross sections for pp{p) reactions has been calculated in Ref. [23] , 
and their subtraction terms in Ref. [28i]. 



2. DGLAP evolution 



Roughly speaking, factorization replaces each logarithm \n{mi/Es) in the partonic cross section with \n{Mf/Es). 
These artefacts of factorization may spoil the accuracy of the perturbation series in the same way that the potential 
mass singularities did unless we ensure Mf — 0[Eg), in which case the Mf dependence of the D'l{z,Mj) must 
also be known. Fortunately, unlike for the z dependence, this Mf dependence is perturbatively calculable, provided 
Mf ^ Aqcd: Writing the M| dependence of the FFs in the form of the Dokshitzer-Gribov-Lipatov-Altarelli-Parisi 
(DGLAP) equation 

with i — —nf,...,nf, then the splitting functions Pij{z,as) are each perturbatively calculable as a series in a^. 
Specifically, denoting the square matrix with components by P, their expansion in takes the form 

oo 

P(z,a.) = 5]p("-i)(z)a:, (3) 

n=l 

where the p("~i)(z) are non-singular even in the limits for which any active parton mass vanishes. In the n/th CWZ 
scheme, the P^ are independent of all parton masses, and thus can be obtained by taking the limit discussed at the 
end of subsection 12.11 and performing factorization in the MS scheme. Equations ([2]) and ([3]) serve similar purposes 
to the evolution of as , 



^^^^as(/i^) = /3(os(/i^)), 



(4) 
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and the perturbative expansion of the /3 function, 

oo 

/3(a.) = -E^«-2<, (5) 

n=2 

respectively, which are used to resum powers of the logarithm In /i in perturbatively calculated quantities, where /i is 
the renormalization scale. In other words, the ultimate purpose of the DGLAP equation is to resum powers of the 
logarithm In M f for all M f ^ Aqcd in the partonic cross section. The physical interpretation of eq. ^ is most easily 
made from its solution, which takes the form 

i?f(^,M|)= I —E^, {^,a^{M]),a.{M^)) D'j{z\M^). (6) 

j=~nf 2 

Each quantity Eij{z,as{M'j),as{MQ)), which also obeys eq. ([2|) on taking Etk, may be interpreted as the FF 

of parton i at resolution scale Mf into parton j at resolution scale Mq carrying a fraction z of parton I's momentum. 
It is subject to the boundary condition Eij{z,as,as) = SijS{l — z) and depends only on the P^"~^\z) of eq. ([3]), 
although its analytic dependence on z may not be calculated. We will see in subsection 12.51 that an analytic form for 
the perturbative calculation of E may be obtained in Mellin space. 



3. Changing the number of active partons 



So far, everything we have discussed applies for a given assignment of partons as active. Using always the same FFs 
at some initial input value Mf — Mo of the factorization scale, we would like to be able to calculate cross sections at 
any energy Eg (in this review, Mj, Mq and E^ are much greater than Aqcd unless otherwise stated). However, to 
ensure that the perturbative calculation of dcr* gives a reliable approximation, all partons of mass rrii <JC Mf = 0{Es) 
must be assigned as active, even if they are not active at Mf = Mq. Consequently, this assignment must be allowed 
to vary and the relationship between quantities defined in a scheme in which the lightest Uf quarks are active (and 
of course the gluon, which must always be active) must be related to similar quantities for which n/ + 1 quarks are 
active. This is trivial for the factorized partonic cross sections dcr*, and the necessary matching conditions for Og are 
known [3l[. The matching conditions between the FFs of the Ufth scheme, I?^ with i = —Uf, . . . ,nf, and the FFs of 
the 71/ + 1th scheme, with i — —{rif + 1), . . . , n/ + 1, take the form 

Dr{z,M}) - f: f ^,aAM])\ D]{z\M)) (7) 

j=-nf ^ \ f / 

where the matrix A is perturbatively calculable for Mf = 0(m„^+i), where this matching should therefore be 
done. Otherwise, the precise choice of this matching threshold is arbitrary because it is non physical, and should be 
distinguished from non arbitrary physical thresholds such as that for the production of a charm quark. In the CWZ 
scheme, A depends on rrinj+i but not the other mu and is now known to NLO (3^ . The only non zero components at 
NLO are Agg, which vanishes if the matching is done at Mf — mnj+i, and Anj+i^g. The latter quantity is not needed 
if the whole FF of the n/ + 1th flavour quark, both intrinsic and extrinsic, is treated as one single function to be 
fitted to data, which is usually the case. This is in contrast to the spacelike case: because intrinsic heavy quark PDFs 
for the proton can normally be neglected, the quantity An^+i^g is necessary for obtaining the PDF of the rif + 1th 
flavour quark, being almost completely extrinsic. 

According to eq. ((T]), a heavy quark FF with a negligible intrinsic component (i.e. its FF vanishes when it is not 
active) is fully determined from the other FFs. Therefore, if the intrinsic components of the heavy quarks' FFs are 
negligible, their FFs are perturbatively completely determined by only the gluon and the light quarks in the nf — 3 
scheme: Using eq. ([7]) to convert to = 4 scheme gives the charm quark FF in terms of the gluon and light quark 
FFs, i.e. charm quark fragmentation proceeds via perturbative fragmentation to a gluon or light quark, which then 
fragments non perturbatively to the detected hadron. Similarly the bottom quark FF for Uf = 5 active quark flavours 
is fully determined from the gluon, light and charm quark FFs. 

On the other hand, it may not be a good approximation to neglect the intrinsic heavy quark FFs even for light 
hadrons, and the intrinsic charm FF is certainly not negligible for charmed hadrons such as D mesons. In many 
calculations for light hadron production, the charm quark FF D^'{z,M'j) is treated as an unknown function at the 
matching scale where Mf = 0(7714), just as the gluon and light quark flavour FFs at the initial scale Mf = Mq are, 
and the gluon and all 4 quark FFs are evolved from there according to the DGLAP equation of the rif = A scheme. 
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A similar procedure is performed for the bottom quark FF. While this incorporation of intrinsic charm in the = 4 
scheme is consistent with the factorization scheme, the conventional approach of setting the complete charm quark 
FF to zero in the cross section calculated in the Uf — 3 scheme is not, because intrinsic charm quark fragmentation 
effects should not depend on the choice of scheme (although the precise definition of the intrinsic charm quark FF 
itself is scheme dependent). Consequently, this approach contains an inconsistency: When the energy, and therefore 
the factorization, scale is close to the charm quark mass so that both the 3 and 4 flavour number schemes should be 
valid, they will lead to very different cross sections in the case that the intrinsic charm quark FF is large. In practice, 
this inconsistency of the 3 flavour scheme, and 4 flavour scheme when one also considers a possibly important bottom 
quark FF, does not matter since most cross sections of interest are of a sufficiently high energy scale that the 5 flavour 
scheme should suffice for all calculations. 



4. Various treatments of quarks with non negligible mass 

In the above discussion, we have assumed that all quarks have non negligible but finite masses. In practice, the 
energy scale is usually not close to any quark mass, so all perturbative results are usually approximated by the zero 
mass variable flavour number "scheme" (ZM-VFNS), where the masses nij of all quarks above Mf = 0{Es) are set 
to infinity so that these quarks decouple from the theory, which introduces a relative error of 0{El /ra^) to the cross 
section, while all quarks with masses below are treated as active and their masses are set to zero, which introduces 
a relative error of 0{mf / E'^). Such an approach therefore fails when the energy scale is of the order of a quark 
mass, but is otherwise a reasonable approximation. However, the procedure we have been discussing in the previous 
subsections is more general and is called the general mass variable flavour number scheme (GM-VFNS), where both 
quarks and their masses are treated in a similar fashion to their treatment in the Aivazis-Collins-Olness-Tung (ACOT) 
scheme [33l [3^ of spacelike factorization. Such a procedure has for example been explicitly implemented for h eavy 
flavour hadrons in photo- [IHl and hadroproduction [2^ in pp reactions, and more recently for e~^e~ reactions [25^ . 
The usual simplifications and improvements that have been made to the ACOT scheme since its inception may also 
be applied to fragmentation, at least in principle. For example, assuming that the intrinsic fragmentation of heavy 
quarks is negligible, the power suppressed terms in the NLO factorized cross section for the inclusive production 
of this heavy quark appearing in the overall hadronic cross section are in fact arbitrary, since the only purpose of 
the former cross section in this case is to complete the cancellation of all potential mass singularities in the ovejjill 
hadronic cross section such that there are no singularities in the massless limit [s^l- In the S-ACOT scheme |3^, 
these power suppressed terms are set to zero for simplicity, while in the ACOT(x) scheme [37|, the accuracy of the 
perturbative calculation relative to the original ACOT scheme is improved: The heavy quark production cross section 
is again chosen to be that for the production of a massless quark as in the S-ACOT scheme, but its LO part is also 
multiplied hy a, z dependent step function which is equal to that multiplying the gluon production cross section's 
potential mass singularity, to ensure it and its counterterm only ever appear at the same time. 

5. Analytic Mellin space solution of the DGLAP equation 

The integrations over z on the right hand sides of eqs. ([T]), ^ and ([6]) are known as convolutions, and are a typical 
feature of results of the factorization theorem. The Mellin transform, discussed in appendix[Cl which does not destroy 
any information because this transform is invertible using the inverse Mellin transform of eq. (jC5[) . converts these 
convolutions into simple products, and therefore analytic work is usually performed in Mellin space. For example, eq. 
([2]) becomes 

-^A_D{N, Mf) = P{N, as{M]))D{N, M]), (8) 

where for further simplification we are now also omitting parton labels i, j and the more trivial hadron label h, it 
being understood in eq. ([5]) that the matrix P acts on the column vector D according to the usual matrix product 
definition. The elements of P(N, as(M|)) for integer N are equal to the anomalous dimensions of the twist two, spin 
N gauge invariant operators in D, which are needed in the operator product expansion. In the solution to eq. ([5]), 



D{N,M]) = E{N,a,{M]),a,{Ml))D{N,Ml), 



(9) 



10 



which is the Melhn transform of eq. ([6]), the elements of E may be expressed analytically in terms of N, as{M'j) and 
as(Mg) up to the same accuracy as P [sl, [s^: Define the series 



C/(iV,a,) = l + ^C/(")(iV)a^ (10) 

such that 



E{N,as,ao) = U{N,as)Ei^o{N,a„ao)U~\N,ao), (11) 

where 



EwiN,as,ao) = exp 



a 

Po ao 



(12) 



is the LO result for E, being an exact solution to eq. ([5]) with P — a^P'-^K The purpose of U^^{N, oq) in eq. pT|) is to 
ensure the boundary condition E{N,as,as) = 1. To all orders, eq. ([5]) can be converted via eq. pTjl to an evolution 
equation for U: 



dU R 1 



das f3o Potts 



U,P^°^ 



(13) 



where R = J2n=i = -Po{P/P + P^°V {l3oas))U . The exponentiation in eq. ([T^ and the commutator in eq. 

([13]) are handled by choosing a specific basis for the FFs in a factorization scheme in which the symmetries of QCD 
are obeyed. Such a basis will be given in subsection 12.61 

As a final remark, the Mellin space formalism makes clear the importance of the DGLAP equation in the application 
of perturbation theory to cross section calculations: Any set of functions Di of the variables < z < 1 and < M'j < oo 
obeys the form of the DGLAP equation, eq. Q, by choosing P{N,as{M])) = {dD{N,Mj)/d\nMj)D-'^{N,Mj), 
which follows from its Mellin transform, eq. ([H]). (Here, D can be regarded as a matrix whose columns consist of the 
Di for a given hadron species which varies from column to column, i.e. there is some freedom in the definition of P.) 
The importance of the formalism behind the DGLAP equation is that P is constrained in QCD to depend on purely 
partonic graphs, and furthermore to be perturbatively calculable (these results follow from eq. (|A29P ). and therefore 
the M f dependence of the FFs is completely and calculably constrained. In particular, when expanded in it takes 
the form in eq. Q with non singular coefficients. However, eq. ([2]) is not the only way to evolve the FFs in M/. 
Alternatives to eq. may be preferable for certain physical reasons, such as the double logarithmic approximation 
(DLA) to be discussed in subsection 18.31 which is more apt for the small x region, or the evolution proposed in 
Ref. 4(|, whose equivalent splitting functions may exhibit certain physical properties not seen in the usual DGLAP 
splitting functions beyond LO, such as the Gribov-Lipatov relations and the expected large x behaviour beyond 
leading order due to purely multi-parton quantum fiuctuations when the physical strong coupling constant is used 
as the expansion parameter. Such equations are similar to the DGLAP equation but with both occurrences of My 
in eq. ([2]) multiplied by (different) powers of z to ensure that the inverse of this modified scale truly represents the 
fluctuation lifetimes of successive virtual parton states pertinent to the kinematic region being studied. However, if 
required, it is always possible to recast such alternative evolution equations back into the form of eq. ^ , and thereby 
use them to obtain an alternative expansion for P in the kinematic region of interest to that in eq. ([3]) . To put this 
in an alternative, but equivalent, way, eq. is not the only possibility for approximating P: In general, P in certain 
limits of N and Os may be better approximated in the form P{N,as) = '^^X''^R^^^\Y), where X and Y are each 
in general a suitably chosen function of both N and as- We will see an example of such an alternative expansion in 
subsection 18.21 namely eq. (|147p (where uj = N — 1). 



6. Symmetries 

Although FFs are not physical, the factorization scheme should be chosen such that they respect the symmetries of 
QCD in order to keep results as simple as possible, and we will assume that this has been done. Indeed, the commonly 
used MS scheme is such a scheme. Consider first the charge conjugation symmetry of QCD. Writing the cross section 
or FF for the production of /i+ (or h^) as \ where O — Di or da respectively, and then defining charge-sign 

unidentified and charge-sign asymmetry quantities as O'* = O'*^ -I- O'^ and O'^"'^ = O''^ — O'^ respectively (or 
more generally as O'*/'' = O'* -|- O'* and O^c'*/'* = O'^ + which includes also neutral hadrons), this symmetry 



11 



implies that each of these two combinations of cross sections only depends on FFs that have been combined in the 
same way (or they vanish). The calculation of such cross sections is therefore simpler than that of the production 
of hadrons of a given charge-sign, which depends on both charge-sign unidentified and charge-sign asymmetry FFs. 
Lorentz invariance implies a similar feature for the polarization of the hadron and of the partons, assuming that there 
are only 2 possibilities for the hadron's polarization. 

We now study the simplifications to the evolution of FFs when the charge conjugation symmetry is taken into 
account. Letting D'^^ — Dj {D^^ = D^j) denote the FF for the hadron h of the (anti)quark of flavour / = 1, . . . , n/, 
this symmetry is accounted for by the results that follow from charge conjugation symmetry, 

j^h-^(orh-) ^j^h-{o.h+)^ (14) 

Equation (fTi|) implies that charge-sign unidentified FFs D'^^^^^ = D^^j-™ ^ (the separate brackets "[]" and "()" 

imply that the changes qi — + qi and — > h~ respectively may be made, and may be made independently of one 
another) where, omitting the superscript "/i+(or h~y for brevity from now on, the sums 

D^^/^^=D,,+D^,. (15) 

Due to charge conjugation symmetry, these FFs and the gluon FF mix only with each other on evolution, but not 
with the valence quark FFs defined in eq. (|19p below. Furthermore, in a scheme for which P is explicitly independent 
of quark masses, such as the CWZ scheme, the SU(n/) symmetry for n/ active quark fiavours of equal mass that 
follows from QCD implies that the gluon FF will mix via eq. ([2|) with the quark FFs combined into one singlet quark 
FF 



n f 

^^ = ;7:E^w?.' (16) 
f 1=1 



In other words, eq. ^ is obeyed with D = {D^, Dg)'^ and 

This SU(nj) symmetry also implies that every non singlet quark FF, being any linear combination of the Dg^/gj that 
vanishes when they are all equal, will only mix with itself on evolution, i.e. it obeys eq. ^ but with P reduced to 
the single quantity Pns which is the same for all non singlets. The non singlets can be chosen such that they and the 
singlet form a linearly independent set of FFs, so that after evolution the FFs of quarks of each fiavour, or any 
other alternative basis of FFs, can be extracted by taking appropriate linear sums. A common choice of the set of 
non singlet FFs is 

Dqj,m = Dgj/qj - D^. (18) 

Equation IM]) also implies that charge-sign asymmetry FFs D^^"^^^ — [— ](— )£'^ '(u/Qi ^ ''^here, again omitting 
the superscript "ft,+ (or /i^)", the differences 

DA^q./gr^Dg^'Dg,, (19) 

which we refer to as the valence quark FFs. Although valence quark FFs are the same as charge-sign asymmetry FFs, 
we distinguish between them depending on the context — when working with the symmetries of quarks as we do in 
this subsection, we shall refer to them as valence quark FFs. Due to charge conjugation symmetry, valence quark 
FFs mix with each other on evolution but not with the summed FFs defined in eq. ([TS]) above. As for charge-sign 
unidentified FFs, this mixing is further simplified by the SU(rij) symmetry introduced above: The singlet valence 
quark FF 



n f 



^ 1=1 



and the non singlet valence quark FFs, which, similarly to the definition of non singlet quark FFs in eq. ()18p . we take 
as 



(21) 
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will obey eq. ([2]) but with P reduced to the single quantity Pa^s and Pa^ns respectively. As for Pns, ^a^ns is the 
same quantity for all non singlet valence quark FFs. At NLO, -PacS = ^'AcNSi which leads to the simpler evolution in 
which each valence quark FF only mixes with itself: Dj\^qi/qj obeys eq. ([2]) with P = Pa^s (or PacNs)- 

In the non singlet and valence quark sector, -Blo can be calculated in the form given in eq. (|12p and the com- 
mutator in cq. (|13p vanishes. In the singlet quark and gluon sector, eq. (1121) and this commutator are both han- 
dled by the "diagonalization" [H, of the 2 x 2 matrix p(o) = A+M+ + A-Af", where A± = (Pg',^ + Pj^g^ ± 



[P^l^ ~ P^^g'Y - 4Pg^g^pj°^)/2 arc the eigenvalues of p(o) and M± = (p(o) - Azpl)/(A± - Azp) are projection opera- 
tors, i.e. they obey Y.i = 1 where i ±, M^M^ = and M^M^ = M^. Then eq. ^ reduces to the calculable 
form Ei^o{N,as,ao) = M'iN){as/aQ)^'-^''-^^/'^°\ and eq. ([TS]) is equivalent to the result 

[/(») ^ ^ i M'R^"'> M\ (22) 

where the C/*^"^ are defined in eq. pO)) . and the P^"-' immediately after eq. p^ . Note that the right hand side of eq. 

only depends on the C/(™) for m — 1, . . . ,n — 1, so U is constructed order by order. In general, the zeroes in 
the numerator of eq. (|^^ lead to singularities in E for complex values of A^. In the calculation of the cross section 
via the inverse Mellin transform defined by eq. (jCSp . it is not necessary that the contour C should lie to the right 
of these singularities. Although this arbitrariness due to the dependence on the choice of C is of higher order than 
the order of the calculation, for numerical purposes it is preferable to eliminate the singularities, by choice of the 
spurious higher order terms. For example, at NLO, E is free of such singularities when eq. (fTTI) is calculated in 
the form E — Ei^o + U^^^ Ei^o^s — Ej^oU'^^'^aQ, which we note obeys the desired boundary condition E{N,a,a) = 1. 
Alternatively, the singularities can be made to cancel simply by choosing U^^{N,aa) in eq. pT|) to be exactly equal 
to the inverse of U{N, oq) [41|, instead of expanding it in qq. 



7. Sum rules 



Intuitively, FFs as probability densities will be constrained by conservation laws. For example, from momentum 
conservation, the momentum of a parton i must equal the total momentum of all hadrons to which it fragments, 
giving the momentum sum rule 

I dz zDl{z,Mj)^l (23) 
h -^0 

for every parton i. Similarly, charge conservation implies the charge sum rule 

V / dz ehD'l{z,M^f)^e„ (24) 
h -^0 

where e^i^ is the electric charge of hadron h (parton i). In fact, whether this probability density interpretation 
is correct, eqs. (l23t and (124]) hold in the MS scheme [l^: They are true for the bare FFs Dg^ of appendix [XI 

for which the probability density interpretation is valid, which is seen by applying the operation '^/^ Jq dz{z)x to 
the matrix element definition of the bare quark FFs in eq. (|A12I) (and the similar definition for the bare gluon 
FF), and identifying the number operator for hadrons of species h in an infinitesimal region of momentum space, 
dNh = {2TTf-'^{dP+ /2P+)d'^-'^PTal{P)ah{P). Then eqs. ^ and ^ follow for factorized FFs at all values of M/ 
in schemes for which no subtraction is made on bare quantities that are free of divergences, such as the MS scheme. 
Such intuitive but theoretically solid results may be regarded as "physical" . 

In practice, eqs. ((23|) and (f24|) are not directly used to impose a precise constraint between FFs. Firstly, the 
summation here is over all hadron species ft., so that they impose no constraint in global fits of FFs in the case 
of a specific hadron. Secondly, FFs at low z are poorly constrained because soft gluon logarithms render the FO 
approximation of the splitting functions inaccurate at small z. This is in contrast to the momentum sum rule for 
PDFs, which imposes a constraint between them for any given hadron because the summation is over parton species 
i instead of over hadron species, and PDFs are better understood at small momentum fraction. Instead, eq. (^5]) may 
serve as an upper bound on FFs and therefore as a check on phenomenological extractions of them because, when 
the sum over h is limited to just a few hadrons and the lower bound for the integral over z is replaced by a value 
sufficiently greater than zero, it will be less than one if all FFs are positive. However, this positivity condition only 
follows from the probabilistic interpretation for FFs, which is not quite correct. 
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Equations (|23p and ([24)1 do impose constraints on partonic cross sections, examples of which will be seen in subsection 
12.91 which give a useful check on their perturbative calculations. 



8. Properties of splitting functions 

The LO coefficients of the splitting functions (the n = 1 coefficients P'"^ in eq. ([3])) of the quark singlet and gluon 
sector are given in appendix |S] Because of the Gribov-Lipatov relations [2^ , they are equal to the spacelike ones 

after the interchange P^^ ^ P^^ is made. The LO coefficients of the splitting functions Pa^nSi Pa^j:, Pt.y. and Pns 
are equal to one another. The NLO coefficients (the n = 2 coefficients P^^^ in eq. ([3])) have been calculated in Ref. 
p3 | (see also Refs. [ll|, HJ] for the correction to a misprint therein). Although the simple Gribov-Lipatov relation 
does not hold beyond LO, the timelike and spacelike splitting functions are related by analytic continuation of the 
form factor from the spacelike to the timelike case [43!, |45l |46|. Of the NNLO coefficients P^^), the non singlet and 
singlet valence, the non singlet and the two diagonal singlet splitting functions have been calculated in Ref. [4g| using 

(2) (2) 

this continuation. Only the off-diagonal splitting functions P^J and P^^ need to be calculated before a full NNLO 
timelike evolution of all FFs will be possible. The resulting reduction in the theoretical error on all calculations when 
upgraded from NLO to NNLO should lead to a reduction on the total errors on FFs obtained in global fits, although 
this reduction will be generally somewhat smaller than the current experimental errors on FFs. 

Using the momentum sum rule of eq. (j23p in eq. ([8]) with N = 2 and all hadron species h summed over gives a 
constraint on the splitting functions in the singlet and gluon sector: 

dxxi2P^''^ix) + P^^^{x)) = f dxxi2P^^l\x) + P^^\x)) = 0. (25) 
Jo Jo 

The factors of 2 account for the identical contributions of quarks and antiquarks. Similar momentum sum rule 
constraints exist for the spacelike splitting functions after the interchange Psg ^ Pgs is made. Similarly, the charge 
sum rule of eq. ((24)) and eq. ([8]) with = 1 implies that the valence quark splitting functions obey 

/' '^^^A.NS (^) = '^^^A.E (^) = 0, (26) 

Jo Jo 

which are similar to the valence sum rule constraints on the spacelike splitting functions. 



9. The simplest case: e^e ^ h -\- X 



We are now in a position to highlight the main features of the perturbative calculation of the above process, 
which serves as a simple illustration of the calculation of factorized cross sections in general. In this subsection we 
assume for simplicity that there are only Uf flavours of quarks, which are all massless. This process proceeds via 
e^e~ —fj*,Z—>qj + qj^h-\-X. For clarity, we will neglect the Z boson for now and discuss the modifications 
due to its effects later. Furthermore, the (anti)quark qj (qj) at the electroweak vertex, called the primary quark, of 
specific flavour J is often tagged in experiment, and we will assume for generality that this is the case. The process 
is calculated by factorization of the modulus squared of the diagram in Fig. ([T]) (left), in which the resulting partonic 
kinematics are also shown (right). 

We work to LO in electroweak theory. Taking da'^{x,Ef) da'l {x,s) and da^{z, . . . 



where qj is the tagged quark, and then making the replacement z 
appear, the cross section can be written 



^ '^^qji^^ . . .) in eq. H]), 
x/z to ensure only differentials in z and not x/ z 



dx 



r T "?) ^;v,„ (? M?) + ^ I ^ (=- »i) (?■ 



H-^(..,.Mj)Z,;(iMj) 



(27) 



up to higher twist terms of 0(Aqcd/\/s) or less. Each partonic cross section may be written 

for X = NS, PS and g. 



{z,s,M]) ^ ao{s)N,Q^,,{s)Cx U,a,(s),ln— ^ 



dz 



(28) 
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The quantity ctq = 47rQf^/(3s) is the leading order (LO) cross section for the process e+e fj,~^ 



where 



- fi , and the 
is the electric 



coupling of the tagged quark qj at the photon vertex is accounted for by Qqj{s) -^e-qj^ - 
charge of the quark qj and Cg that of the electron/positron. Note that Qgj becomes dependent on s when the effects 
of the Z boson, which will be discussed later, are included. The Cx are the coefficient functions whose NLO [i^l and 
NNLO ^4^1 terms are known. For the choice M| = s, the Cx{z, a^, 0) = Cx{z, cls) are given to NLO by 



Cns(-z, as) ^6(1 - z)+ ttsCp 

"ln(l - z) 



2^ 9\ 3 



1 - z 



,2n 



1 - Z 



+ l + 2-^lnz + |(l-z) 
1 — z 2 



Cps(z, fls) =0(aj and 

r 1 + fl - z)2 



(ln(l - z) + 21nz) 



Note that the pure singlet contribution only enters at NNLO. The coefficient functions in the case where the masses 
of partonic heavy quark are not neglected and the remaining heavy quarks are not decoupled (i.e. their masses are 
not set to infinity) are presented in Ref. j24| . 

The non singlet partonic cross section da^f /dz contains only and all those contributions in which the "detected" 
(fragmenting) quark qj/qj is part of the same quark line as that for the tagged qj/qj connected to the electroweak 
vertex. The pure singlet partonic cross section da^f /dz contains all other contributions, i.e. those for which the tagged 
qj/qj that goes through the electroweak vertex is not part of the same quark line as the quark which fragments. It 
is independent of the flavour of the "detected" quark and gives the same result when this quark is replaced by an 
antiquark. Finally, da^Jdz contains all contributions in which the "detected" parton is a gluon. 

In terms of singlets and non singlets. 



-7^(^,^) 

dx 



1 dz 

z 



dz 



^|.(...,Mj)Z,;(iMj 



da^„ 



dz 



Mi 



(29) 



where the sing let and non singlets D^^ j^g are defined in eqs. (|16p and (|18p respectively, and the singlet partonic 
cross sections 



dal 



(30) 



dz dz dz 

The full untagged cross section can always be obtained by summing the tagged quarks in eq. (I27p over all flavours. 



dx 



(x,s) 



dx 



'x,s). 



(31) 



Conversely, eq. (|?f)) may be obtained from eq. (PT|) simply by setting all Qq^ — except Qq, . Thus the tagged cross 
sections daq^/dx are physical at least in the sense that they are factorization scheme and scale independent. 



Now including the effects of the Z boson, i.e. for all processes 



1J + Qj 



dcr'} da'l, y.^ da'' p 

-^{x, s) = — ^(x, s) + -^{x, s). 
dx dx dx 



(32) 



where da^. z-qjl^^ contains the contributions from all processes which couple to the virtual photon and which couple 
to the Z boson in the same way as to the virtual photon. It is therefore equal to the da^Jdx with virtual photon 
effects only but with the electroweak coupling Qqj (s) modified to [iot 



Qqj — ^e^qj ^~ '^^e'^e^qj'^ 



s{s - M|) 
''■^(s-M|)2 + M|r| 



(s-M|)2 + Af|r|^ 



(33) 
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where the decay width of the Z boson Y z = 
/ to the Z boson are given respectively by 



2.4952 GeV [49[ and the vector and axial-vector couplings of a fermion 
2e^ sin'^ 



__3J 

2 sin 0w cos 0y\/ 
2 sin 9w cos ' 



and 



(34) 



with T3 J the third component of weak isospin of the fermion's left-handed component and 9w the electroweak mixing 
angle. The cross sections da^^ p/dx account for all other processes, and contain contributions from Z boson exchange 
with purely axial coupling to the quarks at the Z boson vertex [50l |. Its form after factorization is the same as the 
form of da^Jdx in eq. ([27|l . i.e. 



dx 



{x,s) 



^ L ' 1=1 



dz 



where 



X,F 



' Mf 

z,as(s), In 

s 



(35) 



(36) 



with the electroweak coupling 



2\ 2 



s{s - Ml) 



M|)2 + M|r|' 



(37) 



In the untagged cross section, the replacement a^^ a^j ^g/ must be made in eq. (I37|) in order to account for 

interference effects between diagrams with different quark flavours at the Z boson vertex. Since a^^ = —a^^, where 
=Sti and J2d is any up and down type quark respectively, it is the non zero mass differences within each generation that 
are responsible for these types of contributions. The coefficient functions Cf,nS: Cf,ps smd CF,g are all proportional 
to CpTii, and so must vanish for photons due to Furry's theorem [5l|. The series for Cf,ns begins at 0{al), while 
the series for Cf,ps and Cf.q begin at higher order, i.e. dtr^^ p/dx contributes at NNLO and so is neglected in NLO 
calculations. 

In a single event in e+e^ -^"f*,Z^qj + qj^h + X, the number of hadrons of species h produced with energy 
or momentum fraction between x and a; -I- c?x is 



N^(^,s)dx^^^{x,sl 



a{s) dx 



where, choosing /i = y/s, the total cross section 



cr{s) = ^aQ{s)NcQg,{s) (^l + ^Cpasis) 



(38) 



(39) 



From this result we can obtain two important sum rules. Firstly, by energy or momentum conservation, the total 
energy of the final state, J2h j lo '^^^qj s)Eh: must equal the energy of the initial state, -/s. Using x — 2Eh/ 
this equality of energies is equivalent to 







1 da^^ ^ ^ 
dxx — i^yx, s), 
dx 



(40) 



where Uqj (s) is the total cross section when quark qj is tagged, given by eq. (j39p with all Qq^ set to zero except that 
for I = J. In the MS scheme, the momentum sum rule of eq. and the non singlet, singlet and gluon coefficient 
functions for N = 2 also imply eq. (|40)) . The total electric charge of the final state when quark qj is tagged is 



E 



1 



dx Ch , . , 
(J[s) ax 



(41) 
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which follows from eq. (p4)) . 

The cross section that we have been considering so far in this subsection can be decomposed into cross sections 
in which the vector boson is transversely and longitudinally polarized with respect to the direction of the detected 
hadron, denoted by the subscripts T and L respectively: 



dx 



dx 



qj -.i^ 
dx 



(42) 



As usual, da^j rp/dx and da^^^ lI '^^ ^"^^ factorized in the same way as dcTq ^ I dx in eq. (I27|) . The corresponding coefficient 
functions are defined as before via 



dz 

To 0(a,), 



(z, s, MJ) = cro(s)(5g, (s)C^,x as(s), In ■ 



for X ^ NS, PS and 5, and for 77 = T and L. 



(43) 



CL^Ps(2,as) -0(a2) 



and 

A- z 



(44) 



Therefore, these coefficient functions up to and including the 0(af) terms are required for a NLO calculation of 
du^^ 1^1 dx. This means that the LO part of the longitudinal component of dcr^^ p/dx, being of 0(0^), is required for 
consistency in a NLO calculation of da^^ h/dx- Note that the transverse coefficient functions Ct,x — Cx — Cl.x- 

The asymmetric cross section da^, aI which is due to parity-violating effects of the Z boson, appears in the 
differential cross section [13, El HI 



<1„h 



dxd cos 9 



3(i + eos2 0)^ + ^in2 
8 dx 4 



dx 



- cos 
4 



dx 



(45) 



where 6 is the scattering angle of the produced hadron. From a standard tensor analysis, eq. (|45p is the most general 
form for inclusive single hadron production from a spin 1 vector boson. Note that eq. (|42p is reproduced on integrating 
over cos9. Unlike du^^ rp/dx, da^^ ^/dx and dcrj^ p/dx, the factorized dcr^^ A/d^ depends only on the valence quark 
FFs: 



dx 



(x,s) 



^ dz da, 



qj.A 



dz 



{z,s,M])Dl^^,^^(-Mf 



Here, the partonic cross sections are given by 



dz 



(z, s, M2) = c7ois)Ql {s)Ca z, a,(s). In ■ 



(46) 



(47) 



where the electroweak coupling 

QqM = 

and, to NLO, 



2aeaqj 



s{s-Ml) 



\s-MlY + MlTl 



2VeVqj 



(48) 



CA{z,as) = CT,miz,as) - asCpil ~ z). 



(49) 



3. GLOBAL FITTING OF FRAGMENTATION FUNCTIONS FROM e+e" REACTION DATA 

A comprehensive review of measurements of inclusive single hadron production in e+e^ reactions, e+e^ ^ 7*, Z — > 
h + X, up to the year 1995 is given in Ref. [53], and, to the best of the author's knowledge, all data to the present 
day can be obtained in numerical form from Ref. ^54]. Usually, the normalized cross section 

F^Jx,s)^^^^^^:^^ (50) 
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is measured, where Sa is the set of all tagged quarks. Equation (140]) requires the normalization of this cross section 
to be such that 



(51) 



At LO, the results of subsection 12.91 give 



T QojD'' (x,s) 
^.^(^^^)^Z,^,e^;A^^i^,^^ (52) 

i.e. the measured cross section is essentially an FF or a charge-weighted sum of FFs. In practice, cross sections are 
measured in an x bin of finite width. Writing the range of the bin as a;; < x < Xh^ such cross sections must be 
calculated as 

{FlJixi,Xh, s) - f ' dxF^Jx, s). (53) 

Xh-Xi 7^, 

Fortunately, by working in Mellin space this bin averaging can be calculated analytically: From eq. (jCSp 

{F^J{xi,x,,s) = — / dN ^ ; J^ljiV,.). (54) 

Xh -xi2Tn Jc I - N 

The large amount of accurate data for inclusive single light charged and neutral hadron production from these 
reactions, in particular from LEP, has been used to accurately constrain many of the degrees of freedom for unpolarized 
charge-sign unidentified FFs for light hadrons through global fits. The experimental data sets from e^e" reactions 
relevant for present global fits of FFs for tt*, , vlv-, Kg and A/A particles are summarized in Tables [3] — [7l The 
normalization uncertainty common to all data points is also given and therefore, in order to be correctly implemented, 
should be treated separately from the statistical uncertainty which varies from data point to data point. The method 
to do this will be discussed in subsection 16.21 Measurements in which the quark at the electroweak boson vertex is 
identified, or "tagged", as either a light, & or c flavour quark have been performed by the TPC [HHl, DELPHI [sgI ] 
and SLD 57] collaborations, and as either a u, d, s, b or c flavour quark by the OPAL collaboration [5^, which 
allows FFs of quarks with the same electroweak couplings to be separately constrained, namely u and c quark FFs 
can be separated from one another, and d, s and b quark FFs can be separated from one another [59l |. The tagging 
probabilities 

Vt{x,s)^ f dxFl{x,s) = {l-x){F!;^){x,l,s) (55) 

J X 

have been measured [58| by the OPAL collaboration for qj ~ u, d, s, c and b individually, which in particular are 
the only data that give phenomenological constraints on the separation between the light quark flavours. These are 
the only data from e^e~ reactions which can separately constrain the d and s quark FFs. However, they are rather 
limited in number and/or accuracy, and in particular only exist for x > 0.2. In addition, the experimental definition 
of these measurements may not coincide with the theoretical definition in e^. 11551) 16011 . 

Global fits to data from e+e~ reactions have been performed in Refs. [lO, [filf, |62|. [63l. [6^ . [65l [6^ [67l . [bsI . [69l [toI . 
[tiI . Ft^ . pTSj via minimization of x^, which is typically achieved through the minimization program MINUIT (74| . 
Currently, all calculations are performed to NLO accuracy, and competitive phenomenological extractions of as{Mz) 
have simultaneously been performed in some of these fits. Other determinations of FFs 75, 76, 77] using theoretical 
constraints such as dimensional-counting rules for the large z behaviour [tI, [tI] or Monte Carlo [7^] , have used such 
data to motivate the values of any degrees of freedom in these approaches. Data for which the energy scale Eg is 
less than a few GeV are excluded to avoid higher twist effects, detected hadron mass effects, the unreliability of 
truncated perturbative series, and other effects beyond the standard FO approach that may be relevant at low Es- 
Usually, data for which x < 0.1, or even x < 0.05, are excluded from fits because small x logarithms in the FO 
calculation may prevent this region from being described, or because of other small x effects not accounted for in the 
calculations. Improvements to the theoretical description of this region will be discussed in section [51 Measurements 
of p/p production in e^e~ reactions are rather inaccurate, and therefore the much more accurate measurements of 
production of unidentified hadrons, when used together with measurements of tt^ and production and/or FFs for 
these two particles, would make a significant improvement to the constraints on FFs for p/p. However, the amount 
of contamination by charged particles other than the light charged hadrons in unidentified hadron data is unknown 
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but may be significant: As noted in Ref. [6l|, hadron unidentified data from the ALEPH [t^, [s^I and OPAL [sTj 
collaborations are inconsistent with similar data from DELPHI [5^ and SLD [s^l • Furthermore, including such data 
would require uniting the fits for each hadron species into a single fit, which is a greater computational challenge than 
performing these fits separately (although the fits for different hadron species may have to be united if as{Mz) is 
included in the list of parameters to be fitted). 

Longitudinal and transverse cross section measurements for hadron production in e^e^ reactions have provided 

accurate constraints on the summed FFs D'^ . The differences, namely the valence quark FFs D'l , could 

qi/qi ' ^ i^aqi/qi^ 

be constrained by measurements of the asymmetry cross section da'^^ aI'^^^ which has been performed at LEP [83 |. 
Unfortunately, no identification of the produced charged particle's species has been made in these asymmetry mea- 
surements, so that, as mentioned in section [Jl currently the only constraints on the valence quark FFs are provided 
by data from reactions at RHIC which are rather poor, and also by HERMES data for which low Eg — Q effects 
may be important. 

In global fits, the FFs are extracted at some "initial" or "starting" scale Mf = Mq. The FFs Di, which are usually 
taken to be d\^^ and the charge-sign asymmetry j^^"'^^'^ (which in the case that i is a quark are equal respectively 
to the summed and valence quark FFs), or D'^ and Z?^, are typically parameterized in the form 

D,{z,M^) = iV,z'^'(l - X /.(z), (56) 

and the parameters N^, Ui, bi, . . .are freed in the fit. The function fi{z), which was set to 1 in early fits, depends 
on further free parameters and is used to extend the function space available to the Di. The (1 — z)^ behaviour 
is motivated by dimensional counting rules [vsj . and is expected to set in at large enough Mj due to the large z 
behaviour of the splitting functions [8j| . The behaviour is chosen because of the behaviour of the evolution at 
small (but not too small) z, to be discussed in section [8l However, these physical arguments do not precisely follow 
from QCD and, furthermore, the choice of parameterization used in a fit only needs to provide a sufficient region of 
function space to the FFs for the data to be as well described as the approximation for the high energy theory allows. 
Usually, Mq is chosen to be below the lowest value of Eg of the data, but such that Mq ^ Aqcd in order for the 
perturbative calculation of the DGLAP evolution to still be valid. Typically Mq = 0{1) GeV. In principle, any value 
may be chosen. Even the choice Mq = O(Aqcd) may be justified since the resulting large theoretical errors in the 
evolution around this scale would effectively be absorbed into the parameters on fitting, i.e. these low scale effects 
essentially just modify the choice of parameterization, although then the choice of parameterization in eq. I|56p may 
not be suitable. 

These choices of parameterization for each FF are usually the strongest non perturbative theoretical constraint, 
if the other non perturbative constraints are exact or at least good approximations in the framework of current 
experimental and theoretical information. An example of such a reliable constraint is eq. (jl4p . which is exact in the 
Standard Model (in a physical scheme like MS). On the other hand, as noted for the generation of the asymmetry 
between strange quark and antiquark PDFs of the proton in Ref. [SJI, because PaoE ^^a^ns beyond NLO (see the 
discussion following eq. (j2ip ). the condition for unfavoured FFs to obey Z?^^ = D^^ for all Mf is only possible in 

certain cases. For example, neglecting electroweak effects, suppose we impose the constraint D^^ —D^^ — D'^ 
at Mf — Mq, which follows from SU(2) isospin symmetry between u and d. This symmetry is exact in the limit that 
the masses of these quarks are equal (and electroweak effects can be ignored), which is a reasonable assumption 
given that their masses are 1.5-4 MeV and 4-8 MeV respectively. Then, if the charge-sign symmetry constraint 
D^^ = D^^ for qi = s,c,b, ... is also chosen to hold at Mf = Mq, it will hold for all Mf, as will the isospin symmetry 
constraint above, because I?AcE and i'/^AcNSi which do not mix with any other FFs on evolution, vanish at Mf = Mq 
and therefore all Mf. In other words, the violation of SU(2) isospin is responsible for the asymmetry between the 
fragmentation to tt^ from an unfavoured quark and that from its antiquark. Therefore, these unfavoured asymmetries 
to 7r+ are expected to be larger than the unfavoured asymmetries to e.g. h = K~^. Note that electroweak effects only 
predict non-zero unfavoured asymmetries for tt"*" and K~^, but does not which are the largest. As for eqs. (|23p and 
this Mf independence of the isospin and charge-sign symmetry constraints implies that they may be regarded as 
"physical" , but only when taken together. Other than the constraints mentioned above, insufficient phenomenological 
constraints in global fits may require imposing yet further theoretical constraints between FFs which have no real 
physical justifications and which may not hold for all Mf. We will consider some examples below. 

If FFs for different hadron species are related by a simple symmetry, FFs for one of these hadron species can be 
used to make predictions for, or can be constrained by, processes in which another of these hadrons is produced. For 
example, SU(2) isospin symmetry implies 

n'^" — im* n^s — in-ff* and n"/" - dp^^ ('^7'] 

However, it should be noted here that some hadrons are not produced by direct partonic fragmentation, but rather 
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by decay from another hadron, which itself may have been produced either by partonic fragmentation or hadronic 
decay. If the decay channels involved in the production of e.g. tt^ and tt^ do not respect SU(2) isospin symmetry, the 
relation between their FFs above will be violated even if it is true for direct fragmentation. 

A valuable consequence of the second result in eq. (|57p arises [11]: For any initial state, the difference between 
da^ /2 and da'^s depends only on the FF {Dd - D^)^ /2 = {Du - Dd)^s^ which is a non singlet. Therefore, this 
FF is rather well constrained relative to the other FF components. This is similar to the consequence from charge 
conjugation symmetry that the difference between da^ and da'^~ depends only on the charge-sign asymmetry FFs, 
except for the absence of e~^e~ reaction data in that case. The assumption D^^^^ — ^D^^f^ ^ may be violated by 
those and K'^. arising from complicated decay channels involving other hadrons instead of from direct partonic 
fragmentation, which may differ between and Kg. However, SU(2) isospin suggests that these hadronic decay 
processes for should be similar to those for Kg. An indication of this is found in the AKK08 fit because the fitted 
masses of K^ and Kg differ from their true masses by the same amount, as will be discussed in subsection 17.31 Since 
the non singlet splitting functions and coefhcient functions for e^e~ reactions are known to NNLO, the non singlet 
{Du — Dd)^s can also be extracted, and through it as{Mz), to NNLO. Such a procedure would be similar to the 
NNLO extraction of the non singlet PDF fP — and as{Mz) using data from proton and deuteron targets 86], and 
would allow for a further test of perturbative stability in the timelike case. 

Aside from the rather limited measurements of tagging probabilities from OPAL [s^, defined in eq. ([55)1 . mea- 
surements at LEP, taken at center-of-mass (cm.) energies at the Z pole mass, have not implemented tagging of 
individual flavours of light quarks. Consequently, global fits to LEP data will not significantly constrain the dif- 
ferences between the individual light quark flavour FFs. If the OPAL tagging probabilities are not included in the 
fit, these differences will be completely unconstrained, leading to significant discrepancies between each of the light 
quark FFs from different sets such as BKK and Kretzer, as illustrated in Fig. [51 These discrepancies are constrained 




FIG. 2: Ratios of Kretzer (labeled "this fit") FFs to BKK FFs for 7r± (left) and /C± (right) at Mj = 100 GeV^ (where Mf is 
written as "Q"). From Ref. [13]. 

solely by theoretical constraints, which differ from one set to another. However, LEP data do constrain the sum of 
these FFs weighted with the respective electroweak couplings given by eq. ([33]) (neglecting effects beyond LO). In 
fact, because these electroweak couplings are approximately equal at the Z pole mass according eq. p3l) this sum is 
approximately proportional to the 3 flavour singlet quark FF [59|. This may explain why, despite the very different 
theoretical constraints on the KKP |6lj and Kretzer [gJ sets, the values for this FF from these sets are very similar 
in the fit range of x, except at large z because the data at large x are scarce (see Fig. [3]). Some lower energy data 
was also used in the extraction of these FF sets, although they are much lower in accuracy and number compared 
to the LEP data. Data over a range of cm. energies will provide some constraints on the difference between the u 
quark FF and the sum of the d and s quark FFs. This may explain why, in Fig. [2l the u quark FFs from BKK and 
Kretzer are similar, at least relative to the s quark FF, and also to the d quark FF for K^. However, because the 
electroweak couplings of the d and s quarks are equal at all energies, no untagged e~'"e~ reaction data can constrain 
the difference between their FFs. Therefore, since the OPAL tagging probabilities were not available at the time that 
these analyses were carried out, the difference between d and s quark FFs were constrained by fixing one of these two 
FFs in the fit. For example, in the KKP fit, the d quark FF for tt^ is fixed to the u quark one, which according to 
SU(2) isospin symmetry is a very good approximation. For K^, the s quark FF is fixed to the u quark FF, which 
would be a valid approximation if the s and u quark masses were similar. This is clearly not the case — the s quark 
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FIG. 3: As in Fig. for the ratios of the Kretzer (labeled "K") to the KKP values for the singlet = Dl + Dl + Df + 
of + Df + D^* at Mj = 100 GeV^ (left) and 2 GeV^ (right). From Ref. 



FF should be somewhat larger because the u {u) quark must form a bound state with a heavier s (s) quark from the 
vacuum 1] . This strangeness suppression, measured by the strangeness suppression factor , being the ratio of the 
production probabiUty from the hadronization sea of s quarks to that of u and d quarks, is indeed observed in the 
OPAL tagging probabilities \5§\. For p/p, the d quark FF is fixed to half the u quark FF merely to reflect the fact 
that there are more u than d quarks in the proton. This condition is chosen to hold at Mf = Mg, but cannot hold 
for all Mf, and is therefore not physical. 

To illustrate the reliability of the theoretical approach used in global fits (including the choice of FF parameterization 
and SU(2) isospin symmetry), we show in Fig. [4] the description, over a large range of cm. energies and using the 
KKP FF set, of tt* data from e~^e~ reactions, which provide stronger constraints on FFs for tt* than other data do 
on other FFs. The description of all data except the oldest data, from DASP, is very good in the range x > 0.1. 
Data below this range were excluded in the fit, which hence the deviation there from the data is no cause for concern. 
The fact that this deviation is so large may be due to neglected theoretical effects at small x such as unresummed 
logarithms or mass effects of the detected hadron. 



4. PREDICTIONS FROM GLOBALLY FITTED FRAGMENTATION FUNCTIONS 



The universality of FFs between processes with different initial states as implied by the factorization theorem allows 
data from one process to be described using FFs sufficiently constrained by data from another process, giving a test 
of theoretical and/or experimental results. Essentially, such a universality test involving predictions that have been 
measured would really be a test of whether all relevant physics effects have been accounted for in the calculations used 
in the global fits and in the calculations for the predictions, assuming experimental errors on FFs, to be discussed 
in section [SI have been propagated to these predictions. In other words, such fits will give more of a handle on 
the importance of contributions of supposedly negligible effects such as higher twist. In this section we discuss such 
programs, focusing mainly on descriptions of data from ep and pp reactions. 



1. ep reactions from HERA 



In this subsection we discuss inclusive single hadron production in neutral current DIS, ep —i- e + h + X or, omitting 
the purely leptonic subprocess and assuming the contribution from Z boson exchange is negligible, j*p h+X, whose 
relevant kinematics are given in Fig. [5] (left). The hard scale Eg of the process is provided by Q, where = —q^ > 
is the negative virtuality of the spacelike virtual photon 7*. Unlike e~^e~ reactions, ep reactions are complicated 
by the initial state hadron, which at leading twist contributes partons with densities given by PDFs to the hard 
interaction with the virtual photon, and contributes hadronic remnants to the final state. Because of the hardness 
of the collision, the hadronic final state separates into two clusters which are kinematically fairly distinguishable: 



21 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



X 

FIG. 4: NLO (solid) and LO (dashed) predictions using the KKP FF set for inclusive tt* production at (from top to bottom) 
DASP [13, ARGUS m, TPC [H, TASSO ^ and SLD [13] at, respectively, = 5.2, 9.98, 29, 34 and 91.2 GeV. Each pair 
of curves is rescaled relative to the nearest pair above by a factor of 1/10. From Ref. |61( |. 




FIG. 5: Schematic illustration of factorization in the current fragmentation region for inclusive single hadron production in 
neutral current DIS ep —> e + h + X, giving all relevant kinematics including that of the initial {k') and final (fc) state real 
partons involved in the hard interaction. 



Those produced by fragmentation of a hard parton which must therefore be assigned to h, and those which are proton 
remnants contained in X. However, some hadrons cannot be unambiguously assi gne d to h or the proton remnants, 
which at the theoretical level translates into the need for fracture functions [olI . To^ l in the cross section to absorb 
additional potential mass singularities. Fracture functions describe the process p h + i + X . The detected hadron 
h in this process is a remnant of the initial proton, and the parton i is required to connect this process with the 
hard interaction. In other words, fracture functions describe the partonic structure of the initial hadron after it has 
produced the detected hadron. In this sense, a fracture function is both an FF and a PDF and depends on two 
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momentum fractions. Fracture functions are non perturbative and therefore, like FFs and PDFs, contribute unknown 
degrees of freedom to the cross section. Since on evolution they mix with both FFs and PDFs but not vice versa, one 
anticipates the existence of a scheme and scale independent cross section which does not depend on them. Indeed, 
fracture functions do not contribute to the cross section when the direction of the detected hadron's momentum is 
within the current fragmentation region: In the Breit frame, which is the frame in which the virtual photon's energy 
vanishes and its spatial momentum is antiparallel with the initial proton's (see Fig. [6|), this region is defined by 
9 < 7r/2, where 6 is the angle between the spatial momentum of the detected hadron and that of the virtual photon. 
Consequently, this region is essentially devoid of proton remnants, i.e. hadrons in this region are assigned to h and 

q=iO,Q) k'=(\Q-\Q) 

Before: y* 



After: ^ 




FIG. 6: Kinematics of the virtual photon and of the quark in the proton which it strikes, before and after this happens, in the 
Breit frame. Only the time and 3 components of momenta are given, in that order, the 1 and 2 components vanishing in all 
cases. At LO, all proton remnants move in the negative 3 direction, while the struck quark will fragment into a hadron moving 
in the positive 3 direction. At any order, most hadrons produced by fragmentation of a hard parton will go into the current 
fragmentation region, i.e. will have a positive 3 component of momentum. 



not X, which results in all non perturbative information in the calculation of the corresponding leading twist cross 
section being provided by the FFs and PDFs only. In terms of the Bjorken scaling variable Xb = / (2P • q) and the 
scaled detected hadron momentum 



'2ph ■ q 



g2 



(58) 



the cross section after the change of integration variables z x/z and w — > xb/w is 



Its schematic form is shown in Fig. O (right). For simplicity we have used the same value Mf for the factorization 
scale of the PDFs as that for the FFs. The PDF of parton i in a hadron h is written//', and dcr*-' is the cross section 
for the equivalent purely partonic processes j*i ^ j + X, which is known to NLO [43|. At LO it is given by 



da'^ ( , _ duo 

dzdwdQ^ T' ^' l?'""'^^ 7 ^ ' ^ 



where ctq is the cross section for the elastic process e/i e/i involving one photon exchange in the i-channel. 
The normalized cross section 

is measured in practice, where ddproton is the DIS (ep e + X) cross section. Some cancellation of PDF uncertainties 
between the numerator and denominator of eq. (|6ip should occur. In any case, these uncertainties are expected to 
be lower than the FF ones. The normalization of -Fproton '^^.y be obtained by following similar steps to those that 
lead to eq. (|5ip : Similar to eq. ([35]), the number of hadrons of species h produced with energy or momentum fraction 
between x and x-\-dx\s N^{x,Q'^)dx = [dx/{dapYoton/{dxBdQ^))]dapj.^^.^^/{dxdxBdQ^). The total momentum in the 

3 direction of the hadrons in the current fragmentation region is dxN^(x^ Q^)Ph, where ph is the 3 component 



23 



of the detected hadron's momentum. This must equal that of the struck quark, which according to Fig. [6] is Q/2 in 
the Breit frame. Therefore, since eq. (|58p imphes that x = 2ph/Q in this frame, J^pj-otonC^' cuts) has the normahzation 
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From the momentum sum rule, eq. (|23p . this gives 

rfcTproton , sr^ dw da"- 



dxBdQ- 



:{XB,Q')^J2 



J, w dwdQ"^ 



w, —,as{n ) ) f, 
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V W 



(62) 
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where 



da' 
dwdQ^ 



da'^ 



dzdwdQ"^ ' 



(64) 



which at LO is therefore 



da' 



_ dap 

dwdQ"^ dQ 



w 



(65) 



The region of the {xb,Q^) plane, written "cuts" in eq. (j6ip . is usually specified by experimentalists as cuts on the 
squared cm. energy of the virtual photon-proton system. 



W^ = {P + qf^Q^(—-l 

and the fraction of the energy of the initial electron/positron which is lost in the rest frame of the proton, 

P-q 



VB = 



P ■ k xbs' 



(66) 



(67) 



where y/s is the cm. energy of the initial ep system. Working in the laboratory frame, a lower bound on the scattered 
electron's energy 



E' = E-Q'^ 



E 1 

Ie^s ^ 4E J ' 



(68) 



where E is the energy of the initial electron/positron, is sometimes imposed to prevent the scattered electron/positron 
being falsely identified with isolated low energy deposits in the calorimeter while the true scattered electron/positron 
passes undetected down the beam pipe. The HI collaboration imposes additional cuts [Q^Iq^] on the angle of deflection 
of the electron/positron and struck parton in the laboratory frame, respectively 9e and Op, to maintain good detector 
acceptance. These are given in terms of and by 



and 



cos 9p 



cosf 



xbs (4^2 _ Q2^ _ 4^2g2 

a:Bs(4£;2 + Q2)_4£;2Q2 



XBsjxBS - Q^) - AE^Q^ 
XBs{xBS-Q^)+4E^Q^ 



(69) 



(70) 



The regions in the (xb,Q^) plane used by the HI collaboration in Ref. [9J| and by the ZEUS collaboration in Ref. 



[QSj are shown in Fig. [T] As for Fg^ in e~^e 



proton 



is usually averaged over a finite bin width in x. 



Conservation of energy and momentum implies that, after being struck by the virtual photon, the quark's spatial 
momentum changes in sign only, as illustrated by Fig. [B] Therefore, at LO, all hadrons produced in the current 
fragmentation region originate from fragmentation of the struck quark. (We note in passing that, at LO, hadrons 
produced in the remaining region 9 > 7r/2, the target fragmentation region (see Fig. [B]), can only be proton remnants, 
which theoretically means that fracture functions play a more important role than FFs (or PDFs) in this region, 
making this a perfect region for extracting fracture functions.) Therefore, measurements in the current fragmentation 
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FIG. 7: Cuts in the (a:;B,Q^) plane (where is written "x") used in the HI analysis of Ref. [H] (left) with E = 27.5 GeV 
and = 300.3 GeV, and in the ZEUS analysis of Ref. [H (right) with E = 26.7 GeV and = 296 GeV. The low and high 
regions used by HI are each indicated by the label "HI" and the region used by ZEUS by the label "ZEUS", being bound 
in each case by the nearest cut to the label. From Ref. [o^ . 



region can give good constraints on the FFs. The distribution of hadrons in this region is very similar to that in any 
one of the two event hemispheres in e+e~ reactions with ^/s = Q, where a hemisphere is defined to be the union of 
all directions that make an angle less than 7r/2 with the thrust axis of the hadron distribution of the event, which at 
LO is aligned with either the primary quark or antiquark in the case of e^e~ reactions. In mathematical terms, if 
the range in is negligible, which is usually a good approximation since i^proton is approximately independent of Q 
up to 0(1/ In Q) corrections, the LO calculation 



^p,,,„„(x,cutsi - ^„^^ el{G,,,m + G,,m) ' ^ ^ 



is similar to that for e^e~ in eq. (|52p with = Q and tagged quarks summed over all flavours (and with Qgj — > e^^ 
because the contribution to the ep reaction cross section from Z boson exchange has not yet been calculated) except 
for the presence of the PDF dependent factors Gqj{Q'^) = J^^^^ dxB fqj°^™^{xB,Q^)- Because of the variation among 
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the Gqj, and assuming that SU(2) isospin symmetry and charge-sign symmetry of the initial proton's sea are poor 
approximations, the separation of the different quark flavour FFs, and also the valence quark FFs if the charge-sign of 
the produced hadron in ep reactions is identified, can be constrained by suitable data from both e+e~ and ep reaction 
data, or even by ep data alone by choosing different regions in the {xb, Q^) plane for "cuts" . Recall that untagged data 
from e^e~ reactions can constrain neither the separation between FFs of quarks of the same electroweak couplings, 
nor, in the case of transverse and longitudinal cross sections, the valence quark FFs. 

The ratios of the various tagged cross sections to the total cross section, for both ep and e^e^ reactions, are shown 
in Fig. [5] using the AKK 69] FF set. Calculations for ep reactions are also performed using the KKP and Kretzer FF 




FIG. 8: Left: The ratios of the quark tagged components of the cross section to the untagged cross section for the high Q 
HI data, using the AKK, KKP and Kretzer FF sets. The lowest 3 curves show the contribution from the u quark tagged 
component only, the next 3 curves above the sum of the u and d components, the next ?> u, d and s etc. Right: The ratios 
of the quark tagged components of the e^e~ — > /i* -|- X cross section, where is any light charged hadron, to the untagged 
cross section, at y/s = 91.2 GeV and using the AKK FF set. Both plots, as well as the plots in Figs. [9] — 1121 have been taken 
from Ref. [96l |. and, in all these plots, x is written "Xp". 

sets. As for e+e^ reactions, quark tagging for ep reactions is performed by setting to zero the electroweak coupling 
of all quark flavours except that of the tagged quark flavour, and is therefore "physical" in the sense of being scheme 
and scale independent. However, quark tagging in ep reactions may not be possible at least in the near future. In any 
case, these plots highlight the relative differences in importance of the contributions to the overall production from 
the fragmentations of the various quarks in both reactions. In particular, u quark fragmentation proves to be far more 
important in ep reactions while h quark fragmentation is far more important in e^e~ reactions, as expected from 
the relative magnitudes of the proton PDFs for these quarks. According to Fig. [51 the relative yields of the various 
hadron species are similar. Thus the complementary information on fragmentation between e"'"e~ and ep reactions 
should also apply at the level of hadron identification, which is of primary interest at present. 

Data from ep reactions are no better than data from e+e~ reactions at constraining the gluon FF, since it again 
only enters at NLO according to eq. (j7ip . The most significant constraints on gluon fragmentation come at present 
frompp(p) reactions, to be discussed in subsection 14.21 

Comparisons have been made in Ref. [1^ with data from the HI [o^ and ZEUS [1^, [13] experiments using the 
CYCLOPS program the CTEQ6M PDF set [99| and the AKK, Kretzer and KKP FF sets. The comparison with 
the HI data at low and high ranges is shown in Fig.fTUl The strong disagreement between the calculations from the 
different FF sets at large x most likely arises from large experimental errors on the FFs due to poor constraints from 
e"'"e~ reaction data at large momentum fraction. Otherwise, the calculations are fairly independent of the FF set used 
despite very different theoretical constraints on the FFs among the different sets. In other words, any dependence of 
the cross section on those FF components not well constrained by e^e~ reaction data which may arise as a consequence 
of the differences among the Gq j factors in eq. ([7T|) is in fact negligible, and/or the theoretical constraints on the FFs 
in the case of the KKP and Kretzer sets and the OPAL tagging probabilities in the case of the AKK set are sufficiently 
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FIG. 9: The ratios of the individual hadron species constituting the sample for the high Q HI data, using the AKK, Kretzer 
and KKP FF sets (left), and the ratios of the individual hadron species constituting the sample for the e^e~ + X cross 

section, where is any light charged hadron, to the cross section for the full sample, at ^/s — 91.2 GeV and using the AKK 
FF set. 
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FIG. 10: Comparisons of theoretical predictions using the AKK, Kretzer and KKP FF sets with the x distributions from HI 



reliable. At high Q^, the calculation for all 3 FF sets agrees well with the 
small X values and, perhaps, at large x values found with the lower data 
the FO approach at leading twist. For example, resummation of soft gluon 
at small and large x may be necessary to improve the calculation here. This 
scale variation on the calculation, being largest at small and large x and for 
are found to some degree in the comparison with the ZEUS data of Ref. [9 
the predictions with one another is largest around x = 0.3. The description 
is generally good, but above this range it fails for < 100 GeV^, which a; 
large x in the calculation. However, the scale variation in this region is small 
is stable here. 



data. Therefore, the disagreements at 
may be due to neglected effects beyond 
emission logarithms that become large 
is illustrated in Fig. [TT]by the effect of 
the lower range. These observations 
"I in Fig. [T^ although disagreement of 
of the data in the range 0.3 < a; < 0.5 
.gain may be due to neglected effects at 
suggesting that the perturbative series 
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FIG. 11: As in Fig. 1101 using only the AKK FF set and showing the modifications arising from scale variation. 



Finally, we show the comparisons with the new data from the HI [lOOl | and ZEUS [101] collaborations in Fig. 
[T3l The HI collaboration reports using a sample which is a factor of 10 larger than that for the HI data discussed 
above to extract these data, as well as a better understanding of the experimental uncertainties, while the ZEUS 
collaboration reports an integrated luminosity of 0.5 fb~^ to be compared with 0.55 pb~^ in the older ZEUS data of 
Ref. [ill considered above. The description of these much more accurate data is much worse compared to that of the 
previous HERA data discussed above, and thus may require better constrained EEs and/or further improvements in 
the theory. One possible improvement for the description of the ZEUS data at large may be the inclusion of Z 
boson effects. 

In general, various low effects that have been neglected in these calculations may be important, particularly 
higher twist, heavy quarks and the mass of the hadron. The effect of the latter will be considered in subsection 15.11 



2. Hadron-hadron reactions 



Inclusive production of single hadrons in hadron-hadron reactions are important because they can verify and improve 
constraints on the charge-sign and flavour separation of quark FFs provided by ep and e^e" reactions. Perhaps most 
importantly, data from reactions at RHIC should constrain gluon EEs significantly better than data from ep and 
e+e" reactions can, owing to the occurrence of the gluon EE at LO in the calculation. 

The (dimensionless) quantity describing the inclusive single hadron production in the collision of 2 hadrons h\ and 
/i2, h\hi h + X , that is measured in experiment is 

- .^£%^(PT.„,») = s^^f^(„.y.,). (72) 

(exploiting azimuthal symmetry in the second equality), where ^/s is the cm. energy, where E and p the energy and 
spatial momentum respectively of the detected hadron h, where pt its transverse momentum relative to the spatial 
momenta of hi and /12, which are antiparallel, where the rapidity 

.= ^lnf^, (73) 
2 E -PL 

with PL the longitudinal momentum (in the direction of the spatial momentum of hi) of h, and where x is the scaling 
variable, i.e. in the cm. frame it is given by 

X = coshy ^ (74) 

We may also write 

a; = 1 - U -I- VW, (75) 
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FIG. 12: Comparisons of theoretical predictions using the AKK, Kretzer and KKP FF sets with the ZEUS data [93] (left) 
and showing the modifications arising from scale variation using only the AKK FF set (right). Each data set is measured in a 
specific x-bin and, together with its predictions, is shifted upwards relative to the one below by the indicated value for A. 



where V and W are the variables typicaUy used in perturbative calculations, related to the usual Mandelstam variables 
S, T and U ofh through 

T 

(76) 

W = . 

S + T 
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As for all inclusive single hadron production processes, the cross section at leading twist factorizes according to 

Fl^^_{x,y,s)^Y.jy'Pkh. (^^,y,j^,asiMj)^D':{z,M]), (77) 

where F^^^j^^ is the equivalent partonic production cross section. This equation has the following physical interpretation: 
In any frame related to the cm. frame via a boost (anti-)parallel to the beam direction (for massless hadrons the 
precise choice of frame is irrelevant in the study below — later when we treat the mass effects of the produced hadron 
we will need to specify a frame), the reaction of hi with /12 results in the inclusive production of a parton i, which 
subsequently fragments to a hadron moving in the same direction and carrying away a fraction z of the parton's 
momentum. Note that the partonic rapidity is the same as the hadronic rapidity, since for massless hadrons y can be 
approximated by the pseudorapidity 



7? = - In (^tan - j , (78) 

where 9 is the angle which both the produced hadron and the massless fragmenting parton make with the beam in 
the cm. frame. 

The quantity -F/j^^j^ C3) depends on the non perturbative initial state hadrons hi, i — 1,2, through their 

PDFs according to 

h,.,^,«.(M|)) J\xi j\x.FU 



(79) 



where M^, of which F^_^f^^ is formally independent, is the factorization scale associated with hadron hk- This result, 
which also follows from the factorization theorem, can be interpreted as the inclusive production of parton i from the 
interaction of a parton ik from one initial state hadron hk, where k = 1,2, moving parallel to it and carrying away 
a momentum fraction Xk, with a parton from the other. The perturbative ly calcula ble cross section F^_^^^ describes 
the purely partonic process 1112 i + X , and has been calculated to NLO [l02l llOSj . It depends on the cm. energy 
of the partonic system 1112, given by the square root of X1X2S. Since it is not evaluated in the cm. frame of this 
partonic process, in order to make connection with the cm. frame of the overall hadronic process it must therefore 
depend also on the ratio xi/x2, which determines the Lorentz transformations between the hadronic and partonic 
cm. frames (see eq. (|104l) and the discussion before it). 

The components of Fl_^^^ with i — g are of 0(0^), while the rest are either of this order or higher. Therefore the 
gluon FF appears at LO in this cross section, in contrast to ep and e"'" e~ re actions, and therefore pp reactions can 
provide NLO constraints on the gluon. However, large NLO corrections |l03j | suggest that perturbative instability is 
a large source of error. Indeed, the NLO cross section suffers a large scale variation, as can be seen for instance in 
Fig. 1201 This theoretical error is much greater than that coming from the propagated PDF uncertainty. 

As for ep reactions, the detected hadron h in e.g. hih2 reactions will sometimes be a soft remnant from one of the 
initial state hadrons instead of being produced by the hard partonic processes iiZ2 —^i + X followed by i h + X. 
As for ep reactions, these production channels are accounted for by fracture functions for each initial state hadron 
[9ll |. No further non perturbative input is required since it is clear from which initial hadron a remnant hadron was 
produced. Mathematically, all potential mass singularities which cannot be absorbed into PDFs and FFs can be 
absorbed into fracture functions. As a result of universality, they are process independent (apart from the initial state 
hadron(s)), so that the factorized fracture functions in hih2 reactions, including the potential mass singularities that 
they absorb, are identical to those in ep reactions when the same factorization scheme is used. As for ep reactions, one 
anticipates the existence of a scheme and scale independent cross section for /11/12 — > h + X which does not depend 
on fracture functions, since the evolution of FFs and PDFs does not depend on them, i.e. a "current fragmentation 
region" analogous to that in ep reactions which is free of initial hadron remnants. Certainly, contributions from target 
fr agme ntation to the cross section f^^^^^ that we have been considering will decrease with increasing pT and decreasing 



104| , because the detected hadron gets further away from the beam. Removal of these target fragmentation effects 
altogether may be possible by placing kinematic restrictions on processes with alternative fina l states for inclusive 
single hadron production, such as the semi-inclusive Drell-Yan process hih2 ^ + h + X [l05l|- 

As discussed in the beginning of subsection l2.61 charge-sign unidentified cross sections F'^ = F'^ + F^ (omitting 
the /ii/i2 subscript appearing in e.g. eq. (|72p for brevity) depend only on charge-sign unidentified FFs, and charge- 
sign asymmetry cross sections F^"'^ — F'^ — F^ depend only on charge-sign asymmetry FFs. In both types of 
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observable, the contributions to the production from the fragmentations of the various partons in the proton can be 
studied in a "physical" way, imposing tests on FFs through expectations for these contributions: Now restoring the 
hih2 subscript and omitting the h"^ superscript, we decompose the charge-sign unidentified cross section according to 

^PP — i-^PP ^pp^\ujj {-^pp ^pp^ldv ^ i.-^pp ^ -^PP -^pp)' (^^) 

Although Fpp = Fpp, we have not made this replacement here in order to emphasize that the final states differ, by 
the interchange /i+ ^ h~ . The presence of e.g. "|m^," means that the valence d quark PDF is set to zero, the result 
in Mellin space being that the first (second) term is proportional to the square of the u (d) valence quark PDF. Note 
that these asymmetry generating terms neither depend on the protons' sea partons nor receive contributions from 
interactions between valence u and d quarks. The absence of the latter interactions follows if the cross sections for the 
processes ud ^ + X and ud ^ + X are identical, which holds at NLO. Because the first and second term are 
each scheme and scale independent, they give a physical quantification of the contribution to the overall production 
from the fragmentations of the initial proton's u and d valence quarks respectively. These quarks are the source of 
the charge-sign asymmetry, to be discussed in more detail around eq. (|82p below. Since there are more valence u 
than d quarks in the initial protons, the first term is expected to dominate over the second for tt^ production (see 
the first plot in Fig. [T4|) . since the u and d quark fragmentations are equal, and even more so for p/p production (the 
third plot in Fig. [T3]), because then u is larger than d quark fragmentation . For K"^ (the second plot in Fig. [Tl)) . 
d quark fragmentation is unfavoured, and therefore the contribution from the protons' valence d quarks is expected 
to be much smaller than from their valence u quarks. The third term in eq. (|80p corresponds to the charge-sign 
symmetric contribution, becoming equal to Fpp when the protons' valence quarks vanish. It can be regarded as a 
physical quantification of the contribution to the production from the fragmentation of the collective sea of the initial 
protons. This term does not contribute to the charge-sign asymmetry cross section (to be discussed just now) because 
it is charge conjugation invariant. Therefore, the larger the third term is relative to the first two, the smaller the 
charge-sign asymmetry cross section is expected to be relative to the charge-sign unidentified cross section of eq. ((M)) . 
For the production of hadrons which have non-zero strangeness, or which are superpositions of hadrons with non-zero 
strangeness, which in our set are and A/A in the first case and Kg in the second, the third term is expected to 
dominate: Here, the favoured s quark and u and/or d quark fragmentation from the abundant sea occurs, while in 
the first two terms only the fragmentation from the protons' valence u and/or d quark contributes which necessarily 
involves the production of a heavier s quark. This behaviour is seen in the third plot of Fig. 1141 For tt^ and p/p 
production, for which u and d quark fragmentations are favoured, it is not clear whether the first two terms are more 
important due to the FFs there being an order of magnitude larger than the rest, or the third which accounts for 
fragmentation from the protons' abundant partonic sea. In practice, it turns out that the fragmentation from the 
initial protons' sea partons (the third term) always dominates, even if the charge-sign asymmetry is very significant. 
In fact the plots in Fig. [14] were produced using the unphysical decomposition 

ph± ^ F'^-D't +F'^^D't + F'of, (81) 

i=g,q3 

where in both eq. ((8T|) and ([82)1 we omit arguments, integration signs etc. for brevity, define F'^^' — F'^ ~ F^ and use 
the label qg to refer to sea quarks, since the results are qualitatively similar to those obtained with eq. (j80p . except 
that the third term in eq. ([81]) is much smaller than the third term in eq. ([80]) and therefore gives clearer plots. 
The charge-sign asymmetry is determined from the FFs according to 

pAJi^ ^ pu^ jjA^h^ ^ pd^ jjA^h^ _ (-g2) 

Both terms are factorization scheme and scale independent. Note from the quark composition of tt^ that D^"^ is 

expected to be negative and D^"'^^ positive (this should at least be true for their first moments). In addition, the 
quark composition of the proton suggest that the production of u over u is greater than the production of d over d, 
i.e. i^"" > F'^^ > 0. The first plot in Fig. [15] is consistent with these observations. This allows for the excess of 7r+ 
over TT^, but it should be noted that this excess is not guaranteed unless i^"" is sufficiently greater than F'^^ , or the 
magnitude of Of"^ is sufficiently smaller than D^"'" . For = p/p, all 4 quantities in eq. ([82]) are expected to 
be positive so that a definite excess of p over p is predicted. Because of the expectations F^^ > F'^" and, from the 
quark composition of p/p, Du"^^^ > of"^^^, the first term in eq. ([82]) is expected to dominate over the second. These 
expectations are observed in the second plot of Fig. [15] Likewise, an excess of K'^ over is predicted, although 
in this case the second term in eq. ([82]) vanishes since D^"^ = 0. Note that there is no dependence on Df"^ , so 
that this FF is not constrained by pp(j>) reaction measurements. 
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FIG. 14: Contributions to the production from fragmentation of the initial state protons' valence quarks and sea partons for 
BRAHMS kinematics with 2.9 < y < 3, using the AKK08 FF sets. The notation "u — u" refers to the contribution from u 
valence fragmentation FuD'^ — FuD'^ = F^^ (see eq. ((8l|), etc. The ratios are stacked, i.e. for a given pT value, the 
distance on the j/-axis from zero to the first curve gives the u — u contribution, from the first to the second curve the d — d 
contribution, and from the second curve to 1 the sea parton contribution. From Ref. [t^ . 
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FIG. 15: Contributions to the charge-sign asymmetry Ac/i^ from fragmentation of the initial state protons' valence quarks for 
BRAHMS kinematics for which 2.9 < y < 3, using the AKK08 FF sets. The notation + u" means FuD^"''^ + FuD^"''^ = 

cross section is given by the difference between the two curves. From Ref. [73| . 



3. Other processes 



Further tests of universality have been performed in Ref. [10a | by comparing calculations using the KKP FF set 
obtained by these authors with measurements of unidentified hadron production from pp reactions at the SPS by the 
UAl, UA2 collaborations and at the Tevatron by the CDF collaboration, and from photoproduction (requiring photon 
PDFs) in jp reactions from HI and ZEUS at HERA and in 77 reactions from OPAL at LEP2. The data overall 
were measured over a wide range of and y. Good agreement with all data was found provided that the theoretical 
error was taken to be the variation produced in the cross section when Mf/Eg is allowed to vary from 1/2 to 2. 
This cross section variation is rather large, and the experimental errors are unfortunately largest at large pt where 
additional non-perturbative information such as higher twist is expected to be least important. The photoproduction 
calculations also acquire large errors from the rather badly constrained parton distribution functions (PDFs) of the 
photon. Distributi ons in y genera lly have even larger theoretical errors. More stringent tests of universality were 
performed in Refs. [TotI. flM fioi through analysis of pseudorapidity and px distributions from HI for the process 



ep — > e + tt" + X , and in Ref. jl07l | through analysis of pt distributions from ZEUS for the process ep ^ e + h + X, 
which did not require the use of pho ton PDFs (except in the low Q region lllCi). The disagreement found with the 
ZEUS data was reduced in Ref. through resummation of multiple parton radiation at low pT ■ 
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5. IMPROVEMENTS TO THE STANDARD APPROACH 



The standard FO calculation discussed so far describes much of the data well, but is incomplete. The FO approx- 
imation of the high energy parts of the factorized cross section is only applicable to the kinematic regions for which 
Eg ^ Aqcd and x is not too close to or 1, and the boundaries of this acceptable region are unknown. Terms 
originating from the emission of soft gluons become large for large and small x and low Eg, and must be summed to 
all orders by a procedure known as resummation. We will discuss the well known procedure of large x resummation 
in DGLAP evolution and in e~^e~ reactions in subsection 15.21 and leave the resummation of soft gluon logarithms at 
small X to section IS] Only the ZM-VFNS has been implemented in global fits so far, whose error of 0(1) whenever 
Eg = 0{mi) can be removed by using instead the GM-VFNS. Even with these improvements to the perturbative 
approximation, the definition of the cross section we have been considering so far itself requires various formal modifi- 
cations, which usually become increasingly important with decreasing Eg- The effects of the produced hadron's mass, 
of 0{mf^/ Eg), will be discussed in subsection 15.11 Higher twist effects, of 0(Aqcd/£'s) or less, remain unknown at 
present. 

As for eqs. ^ and ([5]), we omit parton labels from now on. These labels may refer to individual parton species, or 
any other complete, linearly independent basis of FFs. 



1. Hadron mass effects 



The effect of hadron mass is one of the many important effects beyond the standard FO calculation in the region of 
low Eg and small x. In fact, as will be discussed in section[71 the fitted hadron masses for tt^, p/p and A/A occurring 
in the calculation for the e+e^ reactions in global fits are approximately equal to their true values [72[, suggesting 
that it is the first effect to become relevant as Eg and x decrease. 

In this section, we derive the necessary modifications to calculations when the mass to^ of the produced hadron 
h is incorporated in inclusive single hadron production in e'^e~, ep and hadron-hadron reactions. The theoretical 
calculation of the cross section is explicitly independent of hadron mass provided that the hadronic scaling variable x 
is identified with the light cone scaling variables as dictated by the factorization theorem. However, measured cross 
sections are usually differential in other scaling variables such as the fractions of available energy and momentum 
taken away by the detected hadron. Thus, the effects of hadron mass are a multiplicative modification to the cross 
section, and a transformation of the scaling variable of the experiment to the light cone scaling variable x. Otherwise 
the perturbative calculations are not modified, although it should be noted that the modification factor just mentioned 
depends on the momenta of any initial state hadron's partons that take part in the hard interaction. The fact that 
hadron mass effects are most important for low values of the scaling variable x and of the energy scale Eg of the 
process will be seen in the explicit results. 

We begin with eq. ([T]), which is true regardless of whether the hadron mass is non zero, provided z is identified with 
the light cone fraction and x with the light cone scaling variable. The precise definition of z, and x where necessary, 
is given for the processes of interest below. We assume all quarks are massless, although the generalization to massive 
quarks is straightforward. 

e+e^ reactions 112] Using the light cone coordinates defined in the paragraph before eq. (|A6p . we work in a frame 
for which the virtual boson's momentum is given by 



and the hadron's momentum by 



which also serves as a definition of x. The momentum fraction z is defined by the relation to ph of the -I- component 
of each parton's momentum k which contribute at leading twist: 

k+ = (85) 

z 

Note that for massless partons, = 0. The condition fc^ = is always true. The scaling variable Xp defined before 
eq. ([1]) is given by the ratio of the spatial momentum of Ph, Ph ^ {p^ ^ Ph)/^' ^o y/s/2, and so 

., = .(l-^). (86) 
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This can be inverted to give 




(87) 



Consequently, the experimental cross section da'^ /dxp, for example, can be calculated according to 



■ . dx , , da^ , , , , 1 1 1 



Ami 



da^ 

dx 



{x{xp,s),s) 



where, from eq. Ilj with the transformation z ^ xj z made for convenience, 

dcr^ , , dz da^ , ^^o^^h/x ,^ 



dx 



(89) 



is just the usual explicitly hadron mass independent calculation discussed in detail in subsection 12.91 for example eq 



27]) . The formulae above with xe, defined before eq. llJ, in place of Xp are the same but with 

As Fig. [TBI shows, the effect of hadron mass for tt^ is much less than that for the heavier p/p, and hadron mass 
effects become more important with decreasing x. In the case of the proton, the shift at small x in the cross section 
created by hadron mass effects is significantly larger than the theoretical error calculated in the standard way. It is 
also found that these effects become less important with increasing ^/s as expected. 
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FIG. 16: Comparison of the calculation using the AKK08 FF sets with measu rements of the invariant differential cross sections 
for inclusive production in e~^e~ reactions at ^/s — 14 GeV from TASSO [ll3| ] (labeled "Default"). Large x resummation, to be 
discussed in subsection 15.21 has been implemented. Also shown is the calculation when mass effects are neglected (the dotted 
curve labeled "m;, = 0") and when the ratio k = Mf /s is increased to 4 (lower solid curve) and decreased to 1/4 (upper solid 
curve). In the case of tt*, the ruh = curve cannot be seen because it overlaps with the "Default" curve. From Ref. [t^ . 
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ep reactions |96| The hadron's momentum is given by 



Ph 



so that 



or 



( 

\2xq 



X 1 



-^xq ,0 



(90) 



(91) 




(92) 



The momentum fraction z is defined by the relation fc = / z, where k is the momentum of the fragmenting partons 
at leading twist. For massless partons, A;+ = 0, and fcj- = is always true. The cross section 



da" 



dxpdxsdQ'- 



:{Xp, XB:Q^) 




da'' 



dxdxsdQ' 



■{x{Xp),XB,Q'^), 



(93) 



where again da'^ /dxdxsdQ^ is the usual explicitly hadron mass independent calculation, which is given in eq. (j59p . 
This modification to the normahzation is equal to that of Refs. |ll4| up to terms of 0{{m\l {x'^Q'^)Y). 

As Fig. llTl shows. hadron mass effects become important with decreasing x (left) and increasing hadron mass (right). 
Hadron mass effects are also found to become less important with increasing Q (right) as expected. 
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FIG. 17: Left: Comparisons of theoretical predictions using the AKK FF sets with the distribution in x (written "Sp" here) of 
unidentified hadrons from ZEUS ^9^. The modifications to the default predictions (solid line), calculated with mh = 0, arising 
from the replacement of the CTEQ6M PDF set of Ref. [H by the MRST2001 PDF set of Ref. 115], from the removal of the 
evolved gluon, and from the incorporation of the hadron mass effect are shown. Right: Comparisons of theoretical predictions 
using the AKK FF set with the ZEUS data [97|j, measured in the interval 0.1 < Xp < 0.2 and for different values of ruh- From 
Ref. il. 



Note that the effect of the initial proton's mass in HERA data is expected to approximately cancel between the 
numerator and denominator in cq. (|6ip . and thus is not accounted for. Furthermore, it is only important at large xb^ 
while according to Fig. [7] the HERA data are taken at relatively small xb values. 

Hadron-hadron reactions 7^] As for eq. ([55)1 for e+e~ reactions above, the parton's + component of momentum 
in the cm. frame is given by 



(94) 
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if the 3-axis is taken to be parallel with the parton's and hadron's spatial momenta, and 

d<^.^Y.| dzd<jl,^Dl{z,Mf). (95) 

The measured cross section -F';5'i/i2 ^'l- differential in the Lorentz invariant element d^p/E, so we do not need 

to generalize the scaling variable x in eq. (j75p to the case where ruh 0: Equation (|77p is produced by dividing the 
left hand side of eq. ([95|) by this element, which must be related to the equivalent partonic element d^k/k^ that we 
would like to appear explicitly on the right hand side. Assuming massless partons for simplicity, the relations = \k\ 
and E = \/ + mf^ imply firstly that 

dp^ 



zd\p\dQ,, 



^^k\d\k\dn, 

where Vt is the solid angle, and it has been noted that the detected hadron and fragmenting parton are spatially 
parallel, and secondly that, from eq. ([M)) and using fc+ = fc° + |fc| and p"*" = p° + 



2z\k\^\p\ + ^Jp■^ + ml. (97) 

From these last two results we obtain our desired relation 

dp^ 1 dfc3 



E z^R^ fc" ' 

where 



(98) 



R = 1 — f99) 

i\p\ + V\p\' + 0'' 



Explicitly writing the PDF dependence, eq. ([55)1 then becomes 



^^0^ = T.Jd^iJ dx,fl^{x,,Mj)f1^-{x,,Mj) J dzD^{z,Mj)\l\ 



dp z^R^ ' 



(100) 



The dependence of R in eq. (|99p on xi and X2, through \p\ in the partonic cm. frame, needs to be found now. Since 
Px in the cm. frame is the same as that in the partonic cm. frame, the definition of rapidity in eq. (j73p gives 

IpI = TOy coshy', (101) 

where 



mr = ^Pt + ml (102) 
and where the rapidity in the partonic cm. frame 

y' = y + 0. (103) 

The quantity (f> appears in the boost factor e"^ which transforms the sum of the energy and longitudinal component 
of momentum between the partonic and hadronic cm. frames. To obtain the xi and X2 dependence of 0, we work in 
the hadronic cm. frame and align the 3-axis with the beams. The + (— ) component of the parton from hi (^12) is 
kf — xiP^ (fc^ — X2P^), where is the + component of momentum of hi, equal to the — component of momentum 
of /i2. All other components of these partons' momenta vanish. The quantity k^e"^ = fc^e""^ is respectively the + 
and — component of the parton from hi and the parton from /12 in the partonic cm. frame, so that Xie'^ = X2e~'^ or, 
finally, 

= ln,/^. (104) 
V xi 

Through eq. (jl04[) and the 3 equations preceding it above, the dependence of R on xi and X2 is obtained. 

As can be seen in Fig. I18[ hadron mass effects become more important as the mass of the detected hadron increases 
and as px decrease, but remain smaller than the theoretical error calculated in the standard way. Similar results are 
obtained at higher rapidities, i.e. the size of hadron mass effects is fairly independent of rapidity. 
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FIG. 18: Comparison of the calculation using the AKK08 FF sets with measure ments of t he invariant differential cross sections 
for inclusive production in pp collisions at ^/s = 200 GeV data from STAR [llSl . Ill7| ] for which —0.5 < y < 0.5 (labeled 
"Default"). Also shown is the calculation when mass effects are neglected (the dotted curve labeled "rrih = 0") and when the 
ratio k — M'j / s is increased to 4 (lower solid curve) and decreased to 1/4 (upper sohd curve). In the case of tt*, the rrih = 
curve cannot be seen because it overlaps with the "Default" curve. From Ref. [t^ . 





2. Large x resummation 



When X is sufficiently large, the accuracy of the FO perturbative calculations for the equivalent hard partonic cross 
section and the DGLAP evolution is worsened by unresummed divergences occurring in the formal limit x — > 1. Here, 
the "large x" region refers to the region where these logarithms are important, but is otherwise not more precisely 
defined, and could even include rather small x values. The possible importance of large x resummation effects for 
available experimental data was pointed out in Ref. [121 , where uncertainties due to unresummed large x logarithms 
were minimized by optimization of the scale. Resummation of these logarithms should reduce this theoretical error. 
The effects of these unresummed logarithms should become more significant with decreasing Eg, where errors due to 
perturbative truncation are l arges t. Resummation of all large x logarithms that occur up to NLO in e^e'^ reactions 
is now possible using results [llSj which we now present. Resummation makes a significant improvement to current 
global fits [ill, which will be discussed in section [T] 

In general, in the series expansion of the hard part W{x^ Us) of some cross section, these large x divergences take 
the form a"[ln"~''(l — x)/{l — a;)] + , where r = 0, n labels the class of divergence. In Mellin space, these divergences 
therefore take the form a" In"^ N. They may be factored out, which results in the calculation of W taking the 
form 

WiN,a,) = WresiN,a,) (^X]<^fo (^)) ' (105) 

where the FO series in parenthesis on the right hand side is free of these divergences since they are all contained in 
/^es- at large TV is approximated by /^os when the divergences in W^cs are resummed, which involves writing Wj^cs 
as an exponential and expanding the exponent in keeping In N fixed. The perturbative series for the NLO quark 
coefficient function Cq = Cns = Cs for inclusive quark production in e~^e~ reactions contains a leading (class r = 0) 
and next-to-leading (class r = 1) divergence, which can be replaced in the manner of eq. pOSp by all such divergences 
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of these two classes. These divergences are all contained in the formula [H 



lnCqiN,asis))^ / dz— - 

.In t 



- 1 



where 
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0(1), 



(106) 



{A,B){a^) = J2{A,B) 



in). 



(107) 



To determine them explicitly requires the results 

^(1) =2Cf, 



A^'' =-C,[C, ('^ - f ) + f T,n, ) and 



(108) 



-c 



F- 



Since eq. (|106p is algebraically similar to the resummed quark coefficient function of DIS [ll9|, we may obtain the 
divergences of classes r = 0, 1 directly from the MS result in Ref. [ssj . 
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where A^ = a^/Joln A'^. According to the general form of eq. (jlOSp . the resulting resummed quark coefficient function 
that we seek is 



c, = c;=°-i(i + a,(cw-a;=°^i W)) 



where 



C. 



r=0,l (1) 
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(110) 



(111) 



is the coefficient of the 0{as) term in the expansion of C^^*^'! in a^, and ensures that the original NLO result is 
obtained when the whole of the right hand side of eq. (|110p is expanded in Os, i.e. when the resummation is "undone" , 
because it prevents double counting of the divergences. Note that there are an infinite number of other schemes which 
are consistent with this criteria and give the large N behaviour of eq. (|109p . a typical feature of perturbation theory. 
For example, the modification N A^ + a in eqs. (I109P and ()111|) simultaneously, where a is a finite, possibly complex, 
constant, is perfectly valid provided \a\ is not too large. 

Equation (|109p contains a Landau pole when A^ = 1, for which N is real and ^ 1. However, in the inverse Mellin 
transform it is not necessary for the contour in the complex TV plane to run to the right of this pole, as it should 
for the other poles, because it is unphysical, created by the ambiguity of the asymptotic series. In x space it gives 
essentially a higher twist contribution and is therefore negligible. The best convergence of the seri es occurs when the 
contour is chosen to be to the left of this pole, which is known as the minimal prescription '120], which is also the 
most efficient choice for the numerical evaluation of the inverse Mellin transform. 

According to Fig. [111 the effect of the resummation is to increase the cross section (at the standard choice Mf/Eg — 
1), and more so with increasing x. This behaviour is also observed in Fig. 1161 This is a typical feature of soft gluon 
resummation in both timelike and spacelike processes. Fig. 1191 also shows that resummation lowers the scale variation 
if the resummed result is compared with the unresummed result at the same values for Mf /Eg — [ijQ. From this we 
can infer that resummation lowers the theoretical error, assuming that the factorization scale in the resummed case 
can be given the same interpretation as the factorization scale in the unresummed case. However, caution is advised 
here because resummation is not a systematic improvement to a FO calculation in the same way that extending a FO 
calculation to higher orders is. 

Explicit results for the resummation of large x logarithms in ep reactions do not exist at present, but, in the current 
fragmentation region, are likely to be similar to the results for e^e~ reactions above due to the physical similarities 
between these two processes. 
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solid: resummed 
dashed; fixed order 



FIG. 19: Dependence of the unresummed ("fixed order") and resummed cross secti on o n the factorization scale Mf, written 
here as "jj." . The renormalization scale is fixed to the factorization scale. From Ref. [ll8| . 



Large x resummation is particularly valuable for pp{p) reactions where the scale variation is large, as can be seen 
in Fig. 1201 Note again the reduction in theoretical error due to the resummation. Unfortunately, in this case formal 
results only exist for the case that rapi dity i s integrated over all values, the results for a given finite rapidity range 
having to be determined approximately |12l| . 
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FIG. 20: Dependence of the unresummed ("NLO") and resummed ("NLL") cross section on the factorization scale Mf, where 
^ = Mf/pT- The renormalization scale is fixed to the factorization scale. From Ref. [l2ll ]. 



For consistency, the large x logarithms in the DGLAP evolution of the FFs must also be resummed. These 
logarithms occur in the diagonal parts of the splitting function (including the valence and non singlet). The FO result 
for P is already resummed, because these divergences approach 1/(1 — z), or In iV in Mellin space, at every order 
[8^ll22l |. This is because they originate from soft radiation which is classical in nature, and therefore can be arranged 
to appear at LO only by choosing a physical strong coupling 123]. In other words, there are no r = divergences, 
the divergence at LO, proportional to aglnN, is the only r — 1 divergence, the divergence at NLO, proportional to 

IniV, is the only r — 2 divergence, and so on. Up to NLLs, the matrix U appearing in eq. pT|) behaves at large N 
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like \124 



where a+ — E and a- 
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g, and 



E 



exp 



a, \nN + 0{as{as In TV)") 



(112) 
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A method for resumming these logarithms in the ana lytic Mellin space solution to the DGLAP equation of Refs. 
[ssl . [39} discussed in subsection 12.51 is given in Ref. jl24l | . As can be seen in Fig. [5l] the effect of this resummation at 
n/Mf = 1 is much smaller than that in the coefficient functions, although the modification to the theoretical error 
obtained by scale yariation is substantial. 
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FIG. 21: Dependence of the unresummed ("NLO") and resummed ("N LO+ res") non singlet FF on the renormalization scale 
^. The factorization scale Mf is written "Q", and k = ^/Q- From Ref. |l24l ]. 



6. THE TREATMENT OF EXPERIMENTAL ERRORS 



One of the main purposes of fitting FFs to experimental data is to make predictions for other observables that depend 
on them through the uniyersality of FFs implied by the factorization theorem. Assuming that the data of the fit 
were calculated correctly, a disagreement between the calculation of a prediction (or predictions) and its measurement 
suggests either a problem in the calculation of the prediction or that this prediction depends on components of the 
FFs which were not well constrained by the experimental data used in the fit. The latter scenario will be the case 
if the quality of the fit repeated with the inclusion of the prediction's measurement is good, but the former scenario 
must be considered if the prediction's measurement cannot be fitted well. More generally, constraints imposed on an 
as yet unmeasured prediction by a fit can be determined by using instead a fictitious measurement of the prediction 
in the fit and then repeating the fit multiple times for a range of values of this measurement, the acceptable values 
being those for which a good fit is obtained. This is precisely what is achieved by the Lagrange multiplier method, to 
be discussed at the end of subsection 16.31 A perhaps slightly less reliable but currently more practical alternative to 
constraining predictions is to fit only to experimental data for which all relevant physics can be reliably calculated, 
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determine the errors on the FFs propagated from these data, and then propagate those FF errors to the prediction. 
This method is more practical because a fit does not need to be performed each time constraints on a prediction 
are to be determined. In addition, if a prediction which has been measured is beheved to be dependent on new 
physics effects, such as higher twist or physics beyond the Standard Model, regions of the space of this new physics's 
parameters can be easily ruled out by a measurement of this prediction by calculating the prediction's propagated 
error ranges for different values of these parameters and seeing which ranges fail to overlap with the measurement 
within its error range. Otherwise, if the FF errors are not known, these regions can only be ruled out by repeating 
the fit with the prediction's measurement included for many different values for the new physics's parameters. 

In the above discussion, we have assumed that the constraints provided by the experimental data used in a fit are 
complete, in the sense that all systematic effects in the experiment are available. In practice it can happen that some 
important systematic effects are unknown, resulting in a failure to fit or to describe a measurement as discussed above 
even though all relevant physics effects are accounted for in the calculation. In this case it must be decided whether 
this measurement is consistent with other similar measurements from other experiments. 

Most recently, a metho d ha s been developed and applied by the NNPDF collaboration which uses neural networks 
as unbiased interpolants [l25j |. This method solves a number of statistical issues in global fits. 

In the rest of this section we discuss these issues in more detail. We first discuss the meaning of and the 
interpretation of the parameter values at its minimum. The current method of presenting PDF errors such that they 
can be easily propagated to predictions is then discussed, because this procedure is applicable to FFs. Finally, we 
discuss the Lagrange multiplier method already applied in FF fits, since this is perhaps the most reliable method of 
propagating errors to predictions. 

1. Relation between x'^ and probability densities 

In this subsection we define for a given set of measurements and explain why the location of its minimum in 
the space of parameters of the theory corresponds to t he m ost likely values of these parameters according to these 
measurements. Our starting point is Bayes' theorem [49lll26{ . which states that the probability, as a quantification of 
the present degree of confidence from experience, for the vector of experimental values jl to take values between 
and + dj^ and for the vector /* of "true" values /* to take values between /* and /* + dp is given by 

p{r,f)drdf = p{f\ndfp{ndr = p{nf)drp{f)df. (im) 

The probability densities PiflD, P{f*), Pif") and P{r\f) appearing in eq. will now be defined. P{f''\f)df'' 

is the probability that measurements give results between and + df'^ given "true" values /*, and its dependence 
on f is determined from experiment. The quantity x^if^^ /*) is defined through 

(xe-5x'(-^°'/*). (115) 
For example, in the case that the experimental uncertainties are purely statistical and are Gaussian distributed. 

We defer to subsection 16.21 the form of when systematic errors are present. P{p)dp is the probability that 
the true results lie between /* and /* + d/*, which reflects the knowledge of these observables prior to the f^ ± Ci 
measurements. P{f'^)df'^ is the probability that measurements will give results between and + df^ prior to the 
experiment taking place, and may be determined from -P(/'^|/*) and PiP) by performing the integration in the first 
and second equalities in eq. (|114p over /*, using J P{P\f'^)dp = 1 and dividing out the factor df^: 

fe\ ft\ r,l ft\jrt 



Pin - J p{nnp{r)df. [in) 

Finally, P{f*\f'^)df* quantifies the "degree of belief" for the true observables to take values between /* and /* + d/* 
as a result of the experimental results f^, and is the quantity through which the test theoretical calculations are 
constrained by the experimental data by identifying these calculations with the test true values /*. From eqs. (jll4p 
and (fTTTl) . 

^/ 1/ i J Pif^iDPindp- ^ ^' 
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If the experimental errors are sufficiently small, P{P) will be approximately constant in /* in the region of most 
statistical relevance, /* ± ct;, and so approximately divides out of this expression. Then also using eq. (jllSp . we have 
finally 

« e-5x'(r,/*). (119) 

Consequently, the values of /* in which we are most confident are those for which is a minimum, since then the 
likelihood P{P\f'^) is maximal. Since in general we are interested in experimental constraints of the vector a of 
parameters of a theory, whose number should be much less than the number of data points or components of /* 
in order to justify such tests, we should be considering probability densities in a instead of in /*. Any probability 
distribution p(/*) in /* and the equivalent p{a) in a are related via p{f*)df* = p{a)da, i.e. p{a) = p{f*)Jft{a), 
where J ft (a) is the Jacobian of /* in a. Again, if the experimental errors are sufficiently small such that Jft (a) is 
approximately constant in a in the region of most statistical relevance, ^(0!/*^) oc P{P\J'^) and eq. (|119p becomes 

P(a|r) cx e-5x'(rj*(«)). (120) 

Thus the minimization of with respect to a gives the most likely values of the parameters for these experimental 
data. 

If the theore tical mod el and experimental data are reliable, the expected value of the minimized is xtxi^ = 
A^DF ± \/2]Vdf [I27I Il28l |. where A'df is the number of degrees of freedom, i.e. the difference between the number of 
data points and the number of parameters being fitted. Often the value of the reduced Xmin' Xdf = Xmin/-^DF, is 
quoted since this value is always expected to be around unity, give or take y^2/N-£)p. 



2. Incorporation of systematic errors in x 

An experimental systematic effect is a physical effect arising from a single experiment-dependent source (so this 
does not include physical effects which are common to all experiments measuring similar observables), and therefore 
the modification to the central value of any data point due to this systematic effect is completely correlated with 
its modification to any other from the same experiment. Thus, the Kth source of systematic uncertainty will cause 
to be shifted to ff + XKcrf' , where \k, which is statistically distributed, measures the importance of the systematic 
effect itself and so is independent of the measurements, and the measures the influence of this systematic effect on 
each measurement. While XkO'^ is fixed, the relative normalization between Xk and the is not. It can be fixed 
by choosing the probability density in Xk to be proportional to exp[— A^/2], assuming Gaussian systematic errors. 
Therefore, the x^ for Gaussian statistical errors in eq. (|116p is modified to 

= E (:^^^^^&^) ' + Y^Xl. (121) 

For the specific theory used to calculate the data, eq. (|120p allows for an estimate of the Kth systematic effect on 
each data point, Xkct^ , knowing only the statistically acceptable maximum size of this systematic effect on each 
data point, \(jf^\, and its direction relative to the other data points (the sign of crf^), through minimization of x^ 
with respect to Xk- Doing such a minimization numerically would significantly delay termination of the minimization 
routine being used to perform the global fits. Therefore, this minimization should be done analytically [tI, [9§|: The 
most likely values of the Xk occur where 



2 



dX 



K 



= 0. (122) 



Solving these equations for the gives 



jkL 



where the covariance matrix 

a,^afS,,+J2afaf. (124) 



K 
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Substituting the best fit results for the Xk in eq. (|123p into eq. (|12ip gives the more familiar expression for when 
systematic errors are accounted for: 

= - (^"')., (/i - (125) 

In practice, not all information on the systematic effects in an experiment is available. For example, often the 
analysis provides only the "total" systematic error on a data point, equal to all the systematic errors of the data 
point from all sources added in quadrature. In this case, eq. (|116p is typically used in global fits with the statistical 
error modified to be the statistical error and this total systematic error added in quadrature. This corresponds to 
setting the off-diagonal components of the covariance matrix in eq. (|124p to zero, i.e. to neglecting correlation effects 
between data points. More serious is the presence of unknown systematic effects. Such effects have to be neglected 
even though they could be important, but as the number of data sets increases, the cancellation of these effects among 
different experiments will increase as far as the determination of the most likely value for the parameters is concerned. 
Unfortunately, these unknown systematic effects have to be accounted for in the determination of parameter errors. 
We will discuss common methods of doing this in subsection 16.31 



3. Propagation of experimental errors to fitted fragmentation functions 

For Gaussian statistical errors and no correlations between data points, eqs. pi9p and (|116p imply that the standard 
deviation error on /* is ai and is equal to the amount by which it has to increase or decrease with all other f*_^^ fixed 
in order for to increase by one from its minimum value of zero. A more realistic procedure based on this ideal 
one can be applied to propagate experimental errors to parameters a of a given model from the experimental data to 
which they have been fitted. For simplicity, instead of we work with the parameters 

?/i = flj - a°, (126) 

where — is the parameter space location of the minimum in for these data. All following expressions are to 
lowest order in the y^. The change in from its minimum value at the origin = to any small values of the iji is 
approximately 

Ax' ^Y^y^^^^y^' (127) 

where the Hessian H is given by 

Vk=0 

The experimental error on a given Oi is equal to the magnitude of yi that is required to make 

Ax' = 1 (129) 

when all other yj^i are fixed. Thus, in the absence of correlations between the yi, so that the off-diagonal elements 
of H vanish, the experimental error on yi is 1 / ^/Ha . More generally, according to eq. ()120p . H is the inverse of the 

covariance matrix for the parameters, 

(y.yj) = iH-%. (130) 

According to eq. (jl28p . this covariance matrix can be determined numerically by calculating then inverting the matrix 
of second derivatives of x' i which corresponds to the parabolic approximation of eq. (|127p . If this approximation is 
not valid, the covariance matrix can be determined by the criterion of eq. (|129p directly, as was done in Ref. [63 |. 

Ho wever, the criterion of eq. (|129p turns out to be far too idealistic for actual global fits. For example, in Ref. 
[l29j | the inconsistency between the resulting constraints on as{Mz) determined from this criter ion f rom differ ent 
data sets used in the CTEQ6 extraction of PDFs [o^ is highhghted. This is attributed in Refs. |l29l | and |l30l | to 
various sources. Firstly, the published statistical information on the experimental data may not be sufficiently reliable 
— errors, including systematic ones, may be inaccurate or not known at all, or some errors may not be Gaussian 
distributed, let alone symmetric. Thus, for example, a fit to, and extraction of parameter errors from, data from 
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one experiment in which the normalization error is underestimated may result in a good fit and apparently sensible 
results, but a similar analysis using data from two independent experiments suffering this problem will usually not. 
Secondly, the theory, including the parameterization used for the initial FFs/PDFs, may not be valid in all kinematic 
regions spanned by the data being fitted to. The simplest way to handle the r easons for the failure of the Ax^ = 1 
rule, whatever they may be, is to modify the criterion of eq. (|129[) to ^13ll . Il32l | 

Ax^=T2, (131) 

where the tolerance parameter T is a constant somewhat larger than 1. According to eq. p25p . this is equivalent to 
multiplying all errors, statistical and systematic, by a factor T. For exaniple, the authors of Ref. [t^ use ~ 10. 
This is significantly less than the choice = 100 in the CTEQ6 analysis [9^, although the number of data points in 
the latter analysis exceeds that of the former by a factor of almost 10. 

An alternative approach to eq. (|13ip to handle unknown systematic effects is to somehow take into acc ount the 
degree of incompatibility of the data with one another. This is done in the neural network approach of Ref. [l25|. 

The statistical information on parameters of a theory from experimental data can be propagated to observables as 
fo llows . Consider a prediction X which depends on the FF parameters Ui. We wish to determine the uncertainty on X, 
vAX^, resulting from the uncertainties on the experimental data which constrain the a^. Since AX = ^i{dX/dyi)yi, 

The quantity (yiyj) = J P{yk)yiyjdy. The probability distribution in y is chosen to be P{yk) oc exp[— Ax^/(2T^)], 
where Ax^ is given by eq. (|127p . which is the same as eq. (|120p except that the divisor has been introduced to 
ensure that the range of acceptable locations in parameter space are those for which Ax^ < (recall eq. p3ip ). 
This modifies the eq. (|130p . the covariance matrix of the parameters, to {yiyj) = T^(iJ^^) 



A practical formula for calculating the error on a prediction is 



/ayA Sr^ ^(^fe " K^k^)-X{yk = -i^hi) rj.2frj-u X{yk = k^j) - X{yk = -nSkj) , . 

{AX } = ^ ^ hj ^ ' (l-^S) 

where k should be chosen less than or of the order of the error on each y^, but is otherwise arbitrary. In other 
words, constraints on a prediction are obtained simply by calculating it for each of the different sets of FFs for which 
yk = inSki, (and for which yk = for the central value of the prediction). Knowledge of the fitted and their errors, 
or even the choice of parameterization, is not required. 

Equation ()133p can be simplified by diagonalization of the Hessian |l3ll . Il32l | , so that the parameters of the new 
basis are uncorrelated. The normalizations of these uncorrelated parameters Zi are chosen such that 

Ax^=^.?, (134) 

i 

which is achieved through eq. (|127p by taking 

= X] ^iVjVji, (135) 

3 

where vik is the \dh eigenvector of Hij with eigenvalue e^, 

'^H^jVjk = ikVik, (136) 
j 

and the eigenvectors have been chosen orthonormal, 

'^VijVik = Sjk, (137) 

i 

i.e. the matrix v obeys the orthonormal condition = v~^. This change in the parameter correlations due to the 
change of parameter basis is illustrated in Fig. [221 Equation (|133p is simplified to 

{AX') = l^J2 (^iVki^i = t^i^)) - ^iVki^i = . (138) 
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2-dim rendition of d- dim (-16) PDF parameter space 

contours of constant global 
U/i eigenvector in the l-direction 




Original parameter basis Orthonormal eigenvector basis 

FIG. 22: Illustration of the Hessian diagonalization of Refs. |l3l|, |l32|| for the case of 2 parameters for simplicity, showing 
contours of constant in parameter space. In the new space of uncorrelated parameters (right), the contours are circles. From 
Ref. (l33] . 

where the choice t < T should be made. Thus, errors on predictions can be s imply calculat ed from two sets 5*^^ for 
each i of FFs at parameter values yk = yk{zi — ±tSii), as advocated in Refs. jl27l Il3ll Il32| . and is the current way 
in which PDFs are presented. 

The Lagrange multiplier method 13l|, implemented in the global fit of Ref. f6d\, is a more reliable approach to 
obtaining errors on predictions because it avoids the assumption that has a quadratic dependence on the parameters, 
as assumed in eq. (I127p . The price is that it is computationally slower. This method gives the range of values of 
a prediction X such that the minimum in in which the fitted parameters are constrained such that X is in this 
range is no greater than an amount above the unconstrained minimum x^- This is achieved by minimization of 
the function 

F = + AX (139) 

for a range of different values of A, because the corresponding x'^W each case is the minimum given that the 
prediction takes the value X{X). To see this, take dF = and choose the dui such that X{X) is fixed, i.e. such that 
J2i(dX / dai)dai = 0. Using this same reasoning, we see that eq. (I139p . which is the one used in practice, leads to the 
same results as the alternative form 

F-x'+[^^)\ (140) 

with ax and Xq varied, and therefore the Lagrange multiplier method is equivalent to the method of fictitious 
measurements that was discussed in the introduction to this section. 

Figure [27| shows some results obtained with the Lagrange multiplier method, and will be discussed in subsection 

7. CURRENT GLOBAL FITS 

Recent global fits [g^, [71|, [t^, [t^ have made use of various experimental, computational, statistical and theoretical 
advances. 

Data from e~^e~ reactions have provided accurate constraints on charged-sign unidentified quark FFs. pp reaction 
data from RHIC provides better and NLO constraints on the gluon and, along with HERMES data from HERA, is the 
only data which can constrain the charge-sign asymmetry FFs. However, due to the high accuracy and number of the 
e'^e~ reaction data relative to other data, charge-sign asymmetry FFs remain much less constrained than charged-sign 

unidentified FFs. Note that an FF for a particle of a given charge-sign carry a 

large error coming from the charge-sign asymmetry FF Df^"^ . However, this problem can be avoided. As we saw in 
subsection l2.6i the evolution of charge-sign asymmetry FFs is independent of the evolution of charge-sign unidentified 
FFs. Furthermore, observables in general can be combined such that they either depend on charge-sign asymmetry 
FFs or charge-sign unidentified FFs, but not both, and we saw this for the case of e+e~ reactions in subsection 12.91 



47 



and in the case of pp reactions in subsection 14.21 Consequently, fits of charge-sign unidentified FFs and charge-sign 
asymmetry FFs can be performed independently of one another. 

Due to the large amount of data now available for constraining FFs, it is important that cross sections can be 
calculated quickly to ensure that the minimization of occurs in a reasonable time. Calculating da^ via the inverse 
Mellin transform is faster than via the convolution in eq. ([T]) since the evolved FFs in Mellin space are obtained much 
more quickly than the evolved FFs in z space which are obtained via eq. Although the calculations of the da^ 
in Mellin space can be very time consuming, especially if no analytic results exist so that the integration in eq. (jC7[) 
must be performed numerically, they are usually fixed during the fit and therefore they need only be performed prior 
to fitting, at point s in complex Mellin space which serve as the supports for the integration in the inverse Mellin 
transform [7ll . ll33j |. The same also applies for the evolution E in eq. Thus only the initial FFs, which vary during 
the fit, need to be calculated each time a cross section is c alculated. This precalculation is also suitable for adaptive 
integrations such as that used in the function GAUSS '1345 in the CERNLIB package, because, for any given number 
of points to be used for the integration, each and every possible positioning of these points is predictable, the number 
of positionings being finite in number, and the values of the rfa' (multiplied by E) at each point need to be calculated 
only once as and when they are needed during the fitting, and then stored. Most modern computers can manage 
the large amount of computer memory required for this precalculation for adaptive integrations. Although adaptive 
integration routines can take longer than integration routines which use the same points each time, they are more 
reliable. In the case that quantities such as as{Mz) and hadron mass are also fitted, the da^ in Mellin space will 
not themselves be fixed during the fit. Some modification to the simple precalculation to handle such scenarios will 
usually be possible. Finally, we note here that it is not necessary to limit the contour of integration in Mellin space to 
be of finite length, because the infinite contour can be mapped to a finite range by a change of integration variable. 

Fitting the FJ^_^f^^, with some modification to account for logarithmic singularities, to polynomials whose Mellin 
transform can be obtained analytically avoids the use of large amounts of computer memory, but still allows cross 
sections to be calculated sufficiently quickly for global fits to terminate in a reasonable amount of time. The use 
of Chebyshev polynomials of the first kind [l35| proves to be the best choice for speed and accuracy [t^I- After 
performing the analytic Mellin transform of the Chebyshev expansions, eq. (|77p may be evaluated in Mellin space 
(there are no delta functions in F^^^^ (x, y, s/Af|, as(M|)) because these are integrated out in the convolution with 
PDFs in eq. (|79)) V This leads to an accuracy of a few parts per mil. 

As well as these new experimental results and computational methods, the latest global fits of Refs. [60l. FtH. [t^. . 
discussed in subsections 17.11 — AKK08, have made use of the theoretical methods discussed in section [5] and the 
statistical methods discussed in section [51 We compare some of the results of these fits with one another in subsection 

m 



HKNS 



In Ref. [t^, a fit of FFs for tt^, and p/p to all available e+e reaction data, excluding particle unidentified 
data because it may be contaminated with other particles beyond those just mentioned, and also excluding the OPAL 



tagging probabilities of Ref. [58|, was performed and the Hessian matrix of FF parameter errors was calculated. We 
shall refer to these as the HKNS FF sets. The results with errors for the FFs for tt^ are shown in Fig. [23l The plots 
on the left show a reduction in experimental errors propagated to FFs on going from LO to NLO. This is particularly 
significant in the case of the gluon FF, probably because at LO the gluon only contributes through the evolution and 
therefore the cross section's dependence on it is less than at NLO. Furthermore, as can be seen by comparing these 
plots with the plots on the right, FF errors are relatively higher at lower factorization scales, and therefore FF errors 
may be particularly important in pp reaction data. The plots on the right suggest that the AKK, KKP and Kretzer 
FF sets are generally consistent with the e+e" reaction data used in the HKNS analysis. 



2. DSS 

Global fits o f FF s for tt^ and were also performed in Ref. 'ti'I , including also ep reaction data from the HERMES 
collaboration [Hil at HERA and pp data from the BRAHMS, PHENIX and STAR collaborations at RHIC. pp{p) 
reaction data for tt^ and is summarized in Tables |8] and [9] respectively. We shall refer to these as the DSS FF 
sets. Systematic errors due to normalization uncertainties on the data were accounted for. Although the HERMES 
data are measured for Q < 2 GeV, where low Q effects may be important, a good fit is obtained as can be seen in 
Fig. [24] for the case of tt^. Since the charge-sign of the particles is measured, these data also provide much needed 
constraints on the valence quark FFs, or the charge-sign asymmetry FFs which are the same functions. The difference 
between the calculations for the BRAHMS data using the DSS and Kretzer FF sets seen in Fig. [25] indicates that pp 
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FIG. 23; Errors on the fitted HKNS FFs for tt propagated from the errors on the data. Left: The results from a LO and 
NLO fit. Right: Comparison of the NLO HKNS FFs, with errors, with other FF sets. From Ref. 

reaction data provide constraints not provided by e+e^ reaction data, which were the only type of data used in the 
Kretzer analysis. Note, however, that the calculation using the Kretzer FF set is within the theoretical error, which 
is large compared to the theoretical error on calculations for e'^e~ reactions illustrated in Fig. [26l The method of 
Lagrange multipliers discussed in subsection 16 . 31 to determine errors on predictions was applied in this analysis using 
the FFs at Mf = 5 GeV integrated from z = 0.2 to 1 as example predictions (for quark FFs, these are equal to the 
LO calculations of the OPAL tagging probabilities). The result of this study in the case of 7r+ is shown in Fig. [27l 
77j^g has the smallest error, equal to 3% if the tolerance parameter appearing in eq. (|13ip is chosen to be T = vTS. 
This is because all observables in the fit have a strong dependence on this FF. The error on yy^^, 5%, is not much 
larger. This quantity is expected to be well constrained by ep and pp reaction data due to the large PDFs for u at 
low X and for u at high x. Presumably because of the use of pp reaction data, the error on rj^ relative to that on the 

other T]f^ shows that the gluon FF is reasonably well constrained. Presumably because b quark tagged observables 
can be measured more accurately than c quark tagged ones, V^^i has a lower error than 

A global fit of FFs for p/p (and, separately, for unidentified particles) was performed in Ref. [605 by the same 
collaboration, including pp data from STAR. pp{p) reaction data for p/p is summarized in Table [TOl BRAHMS data 
was excluded due to possible contamination of the sample by the beam due to the large rapidity. 

3. AKK08 

A global fit of FFs for tt^,K^, p/p, and A/ A was performed i n. Re f. [t^ , including pp reaction data from RHIC 
and pp reaction data from the CDF collaboration at the Tevatron jl40l |. We shall refer to these as the AKK08 FF 
sets. As illustrated in Fig. [18] for the case of STAR data, the description of the RHIC data was good except in the 
case of the production of A/ A. Normalization errors were accounted for as systematic errors in a covariance matrix 
according to the procedure discussed in subsection 16.21 The \k at the location of the minimum value of were 
determined according to eq. (|123p . and were typically found to be in the expected range \\k\ ^ 2. As for the DSS 
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FIG. 24: Comparison of the calculations using the NLO DSS and Kretzer (labeled "KRE") FF sets of the multiplicities for the 
production of positively (open circles) and negatively (full) tt* measured by HERMES. The shaded bands in the plots on the 
right indicate estimates of theoretical uncertainties due to finite bin-size effects. From Ref. |71J. 
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FIG. 25: As in Fig. 1241 but for the BRAHMS data. The shaded bands in the upper panels indicate theoretical uncertainties, 
and are obtained by calculating in the range 1/2 < Mf/pr < 2, with ^ = Mf. From Ref. [Tlj . 



and HKNS fits, e^e reaction data for x values as low as 0.05 and for y/s < Mz was included with the LEP data 
at -^s = Mz, in contrast to the previous AKK fits [i^, [T^I where a lower cutoff a; > 0.1 was imposed and only LEP 
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X 

FIG. 26: NLO calculations usins the KKP FF set with Mf/y/s = 1/2 (dashed lines), Mf/y/s = 1 (solid lines) and 2 (dot-dashed 
lines) of data from DELP HI [l37t and OPAL [l3i,[i33 at energies ^ = 133, 161, 172, 183 and 189 GeV (from bottom to top 
in this order). From Ref. jlOGl ). 




data at y/s — Mz and TPC data at y/s — 29 GeV was used. Because of this additional lower y/s and smaller x 
data, the effect of hadron mass discussed in subsection 15.11 was incorporated into the calculations of both the pp(j)) 
and the e~^e~ reaction data. The hadron mass appearing in the calculation of the e~^e~ reaction data was fitted, and 
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in this way absorbed (approximately) any other small x, low y^s effects such as higher twist. The results are shown 
in Table [H The results for tt^, p/p and A/A suggest that hadron mass effects are the most important small x, 



TABLE 1: Fitted particle masses used in the calculation of the hadron production from e^e reactions in the AKK08 fit. For 
comparison, the true particle masses are also shown. From Ref. [72]. 



Particle 


Fitted mass (MeV) 


True mass (MeV) 


^± 


154.6 


139.6 


if± 


337.0 


493.7 


p/p 


948.8 


938.3 


K% 


343.0 


497.6 


A/ A 


1127.0 


1115.7 



low ^/s effects for the data considered. This is also consistent with the expectation that the contributions at higher 
twist fall off like 0(1/^^) [l4li, in the sense that if they fell less fast, e.g. like 0{1/Q), they would be expected to 
dominate over hadron mass effects, which fall like 0{1/Q^). The slight excess in the fitted masses is expected because 
the overall production does not arise solely from direct partonic fragmentation but also includes contributions from 
decays of heavier particles. A full error analysis is required in order to determine whether the excess seen in the fitted 
masses is significant. The large undershoot in the fitted masses of and K'^ is very likely due to physics effects not 
present in the production of tt^, p/p and A/ A, the most likely effect being the complicated production mechanisms of 
kaons from decays of heavier hadrons. Note that the undershoots in the fitted masses of and K"s, 156.7 and 154.6 
MeV respectively, are similar, which may be explained by similar production mechanisms, however complicated, for 
these two particles, as expected from SU(2) isospin symmetry. As discussed in section [3l this result suggests that the 
argument in Ref. [85| is unaffected by such production mechanisms. Again, a full error analysis is required here. 

For the reasons given at the beginning of this section, charge-sign unidentified and charged-sign asymmetry FFs 
were parameterized and fitted separately. The strongest non perturbative assumptions used in the AKK08 analysis 
is the parameterization of eq. ([56|) . taking fi{z) = 1 + Ci(l — x)'^' for charge-sign unidentified FFs and fi{z) = 1 for 
the less well constrained charge-sign asymmetry FFs. No further constraints among FFs were imposed other than 

eq. mi), which is exact in QCD, and the SU(2) isospin symmetry conditions Du ^^"'^ {z,M'j) = ^^"'^ {z.M'j) 
which are also exact in QCD in the limit that the difference between u and d quark masses vanish. This is in contrast 
to the DSS and HKNS analyses, where well justified assumptions were also imposed, which provide additional non 
phenomenological constraints on FFs. 

TABLE 2: The minimized values in each of the charge-sign unidentified AKK08 fits. For comparison, the values for the 
unresummed fit are shown (under "Unres. fit"). From Ref. [73] . 



H 




Main fit 


Unres. fit 


^± 


518.7 


519.0 




416.6 


439.4 


p/p 


525.2 


538.0 


K% 


317.2 


318.7 


A/ A 


273.1 


325.7 



The large x resummation discussed in subsection 15.21 was incorporated into the calculations of the e+e" reaction 
data in the charge-sign unidentified fits, and makes a significant improvement in the case of K ^, p/ p and A/ A as 
Table [2] shows, and does not worsen the fits for tt^ and Kg. Further large x data from e.g. BaBar |l42l | would further 
ascertain whether large x resummation improves the description of e~^e~ reaction data. In this case it would be 
interesting to determine whether it is in fact necessary either to implement or to not implement large x resummation. 
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4. Comparisons of the different FF sets 



H=K~, i=u, M=9\.2 GeV 



H=fr,i=c. M =91.2 GeV 
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H=K~, i=d, M=9\.2 GeV 



H=K~,i=b.M=9\.2 GeV 
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H=]C,i=s.M=9\.2 GeV 



H=K~,i=g.M=9\.2 GeV 
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FIG. 28: The FFs for at M/ = 91.2 GeV. 



As an example of how the different FF sets compare, we show the FFs for from the different sets in Fig. [551 
These represent an "average" comparison, i.e. better agreement is found among the different FF sets for tt^ and worse 
for p/pSimilar results are obtained at intermediate and large z for the u quark (except for DSS, and for AKK at large 
z) which is favoured, and for the c and b quarks away from large z which is well constrained by c and b quark tagged 
measurements. For the d quark, which is unfavoured, the results are all very different. The large differences between 
the DSS and AKK08 gluon FFs suggests the constraints on gluon fragmentation provided by pp reaction data are not 
as significant as might be hoped. 

In Figs. [2911301 andlSTl we show the charge-sign asymmetry FF sets for tt*, and p/p respectively. The HKNS and 
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FIG. 29: The quark FFs for Ac7r=^ at M/ = 91.2 GeV. 
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FIG. 30: The u quark FF for l^^K^ at My = 91.2 GeV. 



DSS results are generally rather similar. However, no phenomenological constraints on the charge-sign asymmetry FFs 
were applied in the HKNS analyses, so this similarity is most likely due to the similar non perturbative assumptions 

used in the HKNS and DSS analyses, indicating that most of the constraint on the charge-sign asymmetry FFs in these 
two analyses may come from these assumptions. These results show that the charge-sign asymmetry FFs, which help 
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FIG. 31: The quark FFs for A^p/p at M/ = 91.2 GeV. 



to extract important information on nucleon PDFs 0j S @| as mentioned in the introduction, are poorly constrained, 
and highlight the need for hadron species and charge-sign identified data from e.g. HERA and further such data from 
RHIC, as well as measurements of the asymmetric cross section in e+e" reactions discussed in subsection [ 



8. IMPROVING THE CALCULATION AT SMALL x 



Perhaps the most serious limitation of the standard FO approach is its inability to describe a wealth of small x data 
from e^e~ reactions, because unresummed soft gluon logarithms (SGLs) in both the evolution and the coefficient 
functions spoil the convergence of the perturbation series. This problem even renders some well measured observables 
singular, such as the multiplicity of hadrons of species h, being the average total number of these hadrons produced 
in a single event. Using eq. (|38p. the multiplicity is obtained from the cross section according to 



/■I 1 da^ 
/ dx—-^[x,s), 
Jo ct{s) dx 



(141) 



This can be interpreted in a physically meaningful sense as the number of hadrons produced by fragmentation of 
quark J, because, at LO, 



/ dx 



Qq.i 



(142) 



and because to all orders it is scheme and scale independent. The total multiplicity is of course given by (n'*(s)) 

E.7«(^)>- 
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Resummation of SGLs within this FO framework would make a major improvement to the extraction of FFs, 
because it would allow small x data to constrain FFs at small z for the first time. Furthermore, the admission of 
these new data at small x imposes further constraints on FFs at large z because of the convolution over a; < z < 1 in 
eq. H]). As we will see, this resummation also permits a calculation of the multiplicity. 

In the DGLAP evolution, the largest of these SGLs, the double logarithms (DLs), can be calculated to all orders 
and then rcsummcd by using the results of the double logarithmic approximation (DLA). Since the only SGLs that 
appear at LO are DLs, a LO evolution of FFs that includes small z is now possible. However, at present the full DL 
contribution to the gluon coefficient function for e^e~ reactions is unknown, and is necessary for the calculation of 
the complete DL resummed cross section. In a LO calculation, it may be a reasonable approximation to neglect this 
DL contribution because the gluon contribution vanishes at this order. A consistent NLO fit including small x data is 
not possible at present because the next largest classes of SGLs after the DLs, which in order are the single logarithms 
(SLs), the sub-single logarithms (sSLs) and the sub-sub-single logarithms (ssSLs), none of which arc known in full, 
are all present at this order. 

In addition to these shortcomings, the study of inclusive single hadron production data at small x is usually studied 
within a different frame work t o the standard FO approach, namely the framework of the modified leading logarithmic 
approximation (MLLA) |l43l Il44l [l45„ 146] , in which both DLs and the most important contributions from the 
SGL-free part of evolution are accounted for, which leads to a good description of the data over the whole x range 
currently measured. In addition, in practical applications the non perturbativc components, the FFs, are usually 
fixed, up to a n overall normalization, by the assumptions of local parton-hadron duality (LPHD) and the limiting 
spectrum |l47l |. rather than being fitted to experimental data. Perhaps the reason for this is that, until recently, the 
way in which SGLs can be incorporated into the FO evolution was not known. A procedure for resumming SGLs 
in the evolution within the FO framework now exists: As for coefficient functions, for every class of SGLs in the 
FO splitting functions, the remaining SGLs to all orders are added. This approach reduces to the MLLA in certain 
approximations, but it provides more than a mere alternative to the MLLA because it is in principle complete: it 
contains the quark contribution to the evolution, and it can be systematically improved. However, the simplicity of 
the MLLA still makes the MLLA an attractive alternative for incorporating certain types of phenomenology such as 
medium effects or double hadron production at small x. 

In this section, we discuss the present state-of-the-art in the description of small x data from e'^e" reactions. We 
first define, in subsection 18.11 the SGLs appearing in FO calculations, and then, in subsection 18.21 derive the form 
taken by SGLs belonging to a specific class when they are summed to all orders within the FO framework. We discuss 
the DLA in subsection 18.31 which is crucial for determining the DLs to all orders in the splitting functions in the FO 
framework. By direct application of the DGLAP equation, we then generalize the DLA and the FO approximations 
to a single approach in subsection 18.41 This approach incorporates both approximations, and is the lowest order 
approximation the approach developed in subsection 18.21 It can be systematically improved both in the FO part 
and in the SGL part. We show in subsection 18.51 that this approach reduces to the MLLA in certain limits used in 
the MLLA framework. Some discussion is then given on LPHD and the limiting spectrum which are used to fix the 
remaining, non perturbativc, degrees of freedom in the MLLA, Using the approach of SGL resummation within the 
FO framework, we are then able to give a perturbativc argument for the results of the LPHD and limiting spectrum 
in subsection 18 . 61 bv studying the moments of the evolution. 



1. The behaviour of the evolution at small z 



The approximation for P in eq. ([3]) fails at small z due to the presence of terms which, in the limit z — > 0, behave 
like (a^/z) In^"-"-! z for m ^ 1, 2n - 1. Such logarithms are called SGLs, and m labels their class. The class of 
logarithms for which m = 1 are the largest logarithms, known as DLs, while the next largest, for which m = 2, are 
called SLs, the next largest sSLs, the next largest ssSLs, and so on. As z decreases, these unresummed SGLs will spoil 
the convergence of the FO series for P{z,as) once ln(l/z) > 0(a7^^^). Consequently, the evolution of the Di{z,Mj) 
will not be valid at such small values of z, since the whole range z < z' < 1 contributes in eq. Therefore, according 
to eq. ID) , the FO approach is only a good approximation for cross sections whose x value, being the minimum value 

— 1/2 

of z, is sufficiently large, specifically it must obey ln(l/a:;) < 0{as ). 
Using the result 

1 1 In^-ii , , 

dzz" ^ (143) 



Pl Jo 

for Re(a') > and the integer p > 1, we find that in Mellin space these SGLs behave like a"/uj'^"~"^ where uj = N — 1, 
i.e. they become singular as w — > 0. Thus the standard FO approach in Mellin space will not be a valid approximation 
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once |a;| < 0{as ). Sometimes, those Mellin space terms for which m = 2n, i.e. which behave hke a" at small cu, 
are often included in the definition of SGLs. For example, in the MLLA, the m — 2n — 2 term in P{z, a^) at LO is 
referred to as an SL. Since such terms behave like a"(5(l — z) in z space, i.e. do not affect the accuracy of the FO 
expansion at low z and so do not need to be resummed, we will not include such terms in our definition of SGLs, 
which will always be restricted to m < 2n — 1. 
By writing eq. (1142^ as 

it is clear then that the multiplicity is undefined in the FO calculation because of the unrcsummed SGLs in the 
splitting functions. (More precisely, the multiplicity at any value of s cannot be determined from the multiplicity at 
a given value of s because its s dependence in the FO approach is singular if the SGLs are unresummed.) Beyond 
LO, unresummed SGLs in the coefficient functions will also give singular contributions to eq. (jl44p . 



2. A unified formalism for small and large z evolution 

The correct way to approximate P(uj,ag) such that both the large and small \uj\ regions (and he nce the la rge and 
small z regions) of the evolution can be described by the DGLAP equation is to put it in the form [ll2l Il48l | 

P{uj, a,) = P^{uj, as) + pS°L(a;, a,), (145) 

where is equal to P after all SGLs have been subtracted, so that it can be approximated in Mellin space for 

both large and small as the FO series 

oc 

P^(a;,a,) = ^a^P^("-i)(c.), (146) 

ri=l 

truncated at some finite value for n, similar to eq. ([3]) (note that P^^^"^^) (oi) cx hiuj at large w, as discussed in 
subsection I5.2p . and where P^^il contains only and all the SGLs in P, and which therefore can be approximated in 
Mellin space by the series 

■m—1 

truncated at some finite value for m. For a given value of m, (^)™ffm (^) in eq. (I147P is obtained by resummation 
of all class m SGLs in P. Equation p47p is just the general result of expanding a function of and in a^/oj keeping 
asjuP' fixed (although it is not completely general because the series starts at m = 1). Equation (|147p should be a 

1 /2 

good asymptotic approximation in the region \lo\ — 0{as ), at least for sufficiently small a^. In fact, from incomplete 
calculations up to the class m = 2 jl43l | , in particular the splitting function for the MLLA evolution of the gluon FF 

in eq. (|177p below, P^'^^ approximated as in eq. (|147p is believed to be valid for the whole region \uj\ < O(aV^) as 
well. In particular, at w = it is expected to be a series in y/a^ with finite coefficients, beginning at 0{y/a^). Note 
that eq. (|145p falls to zero as \uj\ oo, because in this limit P (and therefore P^^iL-j jg ^^y^ approximated by an 
expansion in as keeping uj fixed, wherein the SGLs vanish. This condition is met by eq. (|177p . 
From eq. (|143p . the inverse Mellin transform of eq. (|147p gives the z space result 

1 ^1 °c 

pSGL(^^ a,) = 1^ /™ ("'^ ' (148) 

m—1 

which can also be obtained by summing the SGLs in z space for each m. Equation (|148p shows that the approximation 
of eq. (|147p is valid in the evolution of D{z,Q^) in the whole region ln(l/z) < 0{as ^^^), which includes the small, 

— 1/2 

but not arbitrarily small, z region ln(l/z) = 0{as ). Note that the usual condition ag 1 must still hold in order 
that the approximation in eq. (|146p holds everywhere in this range of z. If P^*^'^(a;, Os) approximated as in eq. (|147p 
really is finite at w = 0, then P^^'"(z, Gs) in eq. (|148p must obey zP^^^(z, a^) — s- as z ^ 0. 

Using the DLA, it is in fact possible to calculate the m = 1 term in eq. (|147p or equivalently in eq. (|148p . i.e. the 
complete DL contribution. We show how to do this in subsection 18.31 
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3. The double logarithmic approximation 



The DLA 11491 1 states that, if the evolution is written in the form 
d 



D{z,M 



f) 



a4Mf)D{^,M^)] + l'^Piz'MMf))D{^,Mj) (149) 



where {2Ca/z)A = pDL(o)(-^-) jg 

the LO term in the expansion in of the full DL contribution to P, P , given by 
the 171 = 1 term in eq. (|148p . then P{z, Gg) is free of DLs. A will be given explicitly in the basis of singlet, non singlet 
and valence quark FFs just now, when we will see that A is a projection operator, its normalization being chosen such 
that it obeys = A, and this property will make resummation calculations to appear later in this section easier. 
Equation (fUg]) can be obtained from the generating functional technique [143] in the context of angular ordering 
(l50| | of successive emissions of gluons in the process e'^e~ ^ q + q + Ng, where Ng refers to N gluons in the final 
state. The quantity P is completely constrained by the DGLAP equation. It can be obtained in terms of P order by 
order in a^, by expanding the operator in eq. (I149P in the form 



d In Mi 

z f 



exp 



21nz' 



d\nM] 



E 



(21nz')" 



din 1/2 



and then repeatedly applying the evolution equations, Eqs. ([2|) and ([4]), to the 
ations in eq. (|149p . For example, to 0{al) (NLO) one finds that 



dh-iMj 



P[z,a,) = P{z,a,)-2CAA 



a , , , In z 

— + 2(3{as) 

z z 



^ dz' 2a,lnz' 



-P 



(150) 



oper- 



(151) 



In the square brackets on the RHS of eq. (I15ip . only the first term contributes to the 0{as) (LO) part of P, while the 
second and third term contribute to the 0{a?g) part. To this accuracy, the third term is calculated with P{z,as) = 
agP'-^^z) (see appendix [Bl for the explicit functions). From eq. (|15ip . we observe that P to NLO is free of DLs, by 
taking the DLs in P{z, Ug) for the first term on the RHS of eq. (|15ip to NLO, 



P^^(z,a,) 



or ^ 

ZL^A — O's 

z 



AC 



AW z 



Oial 



(152) 



In eq. (|149p . A is zero whenever is a non singlet FF or composed of valence quark FF. In this case, eq. ^ implies 
P = P, i.e. the splitting functions for such FFs are free of DLs. For D = (-Ds, Dg), 



A = 



2Cf 

Ca 

1 



(153) 



Note in general that A^ = A, as mentioned earlier. 

For consistency, resummation of the DLs in the hard partonic cross sections (i.e. the coefficient functio ns) o f the 
process under consideration is also necessary. In principle, this has been done for e^e~ reactions in Ref. [15ll |. the 
result being that Cns and Cps are free of DLs while 



Cg{uj,as) 

The expansion up to NNLO of this result, 

Cg{uj,as) = 



2Gf 

"c7 



2Cf 

Ca 



lid — \/uo'^ + IQCao-s 
2 + IQCjal 



48Cl 



may be compared with the small lo limit of the result in Ref. ([152]) 

2Cf 



Cg{uj,as) 



Ca 



40Ci ^ 



(154) 



(155) 



(156) 



There is clearly agreement at NLO, but not at NNLO. The most likely reason is the difference in the choice of scheme, 
or rather, since the DLs should be scheme independent, an lo dependent difference in the choice of normalization of Cg 



58 



between the two approaches. Further studies of the factorization procedure used in Ref. [151l | are needed here. In any 
case, the coefficient functions for e"'"e~ reactions at LO are free of DLs and so it may be a reasonable approximation 
to neglect them altogether. This is done in the MLLA, to be discussed in subsection 18.51 

The DLA implies a relation among quark and gluon FFs which simplifies calculations of cross sections. According 
to eq. (|149p with P neglected, which is a reasonable approximation at small z since it is free of DLs, the ratio of 
dD^/dlnMj to dDg/d\nMj is 2Cf/Ca, while the operation of d/dhiM'j on the non singlet and valence quark FFs 
gives zero in this approximation. Integrating these results over In Af| and neglecting the constants of integration 
relative to the FFs gives 

D,., = D-g,, = ^Dg, (157) 

reducing the number of FFs required for the cross section to just one, Dg. Although we will not use cq. (|157p in this 
subsection, we will consider an application of it in subsection 18.41 

Equation p49p is very difficult to solve numerically. An alternative approach [ll2l Il48l | which is appropriate for 
numerical work is to use eq. (|149[) to obtain P^^ and then to solve eq. ([2]) with P given by eq. (|145p . P^^ is obtained 
by substituting eq. ([8]) into the Mellin space transform of eq. p49|) : 

+ '^,^^ D ^2CAasAD+ | ^ + 2 "^^^^ ]pD, (158) 

I rflnM^ I dliiMj \ d\nM^ I ^ ^ 

where for brevity we have written D — D{lo, Mj), ag — as(M| ) and P = P{u!, a^). The result is 



^dl-nMj 



{P -~P) +2{P -~P) P -2CAasA = Q, (159) 



where again for brevity we have written P = P{u!,as). As discussed at the end of subsection 12. 5( eq. ([8]) with P 
unspecified is completely general, so that at this point eq. p59p is simply an alternative way of writing cq. (I158p . In 
particular, P{u!,as) in eq. (|159p must not be seen as an expansion in ag. To obtain P^^, we split P into its DL and 
non DL (P^) parts, 

P^P^ + pDL^ (160) 

then expand eq. (|159p as a series in as/cu keeping ag/uj^ fixed (as in eq. p47p . noting that P^ and P are of 0((as/w)^), 
that ui = {as/ijj){as/uP')~^ = 0{as/uj) and that = (as/u;)^(as/w^)^"'^ — 0((as/u;)^)) and finally extract the first, 
0{{as/u!)'^), term to find that the constraint on P^^ is exactly 

2(pDL)2 ^ ^pDL _ 2CAasA = 0. (161) 

Equation (jl6ip gives two solutions for each component of P. Since P is never larger than a 2 x 2 matrix in the basis 
consisting of singlet, gluon, non-singlet and valence quark FFs, there are four solutions. The only solution which can 
be expanded in and which reproduces the DLs in P at LO and NLO, given by 

/ a n _ „2 16CfCa \ 

^°"(-.«^) : ' -2C. +0{a% (162) 

is 

A 



P°L(w, as) = -(^-LU + v/w2 + WCAas) ■ (163) 



Equation (jl63p agrees with the results of Ref. [l5l| , which are derived using the conventional renormalization group 
approach, and with the results from the generating functional technique of Ref. 143). 

4. Incorporating the DLA into the DGLAP equation 

To a first approximation, the evolution valid for all ln(l/z) < 0{as ) takes the form of eq. ([2]) with 

P(z,a,) =a,P^(")(z) + pi^L(z,a,). (164) 
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Here, P^^ in z space (the m = 1 term in eq. (|148p ) is given by 

P^^iz, a,) = (4Vc^s\n , (165) 

z In - V z J 

where Ji(y) is the Bessel function of the first kind, which can be calculated from the result 

Ji{y) = - / ^6* cos(y sin 61 - 6). (166) 
Jo 

In Fig. [221 we see that Pgg{z,as) calculated as in eq. (|164p interpolates well between its 0{as) approximation in the 

FO approach, Pgg\ at large z and P^^{z,as) at small z (the small difference here comes from P^^(o)(z) at small 
z). DL resummation clearly makes a large difference to P at small z. 

5 I , , , ^ , , 1 
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FIG. 32: (i) Pgg{z,as) calculated as in eq. (IT64)l (labeled "DL+L0(+L05)"), (ii) Pgg{z,as) calculated to 0{as) in the FO 
approach (labeled "LO"), and (iii) P°^iz,a^) (labeled "DL"). as = 0.118/(27r) and "x" is z. From Ref. [TT^ . 

Higher order calculations in the FO approach should extend the validity of the FO calculation of the evolution to 
smaller z values, but not by as much as the result in which all SGLs up to a given class are included because, for 
sufficiently small z, the order required for the evolution in the FO approach to be accurate will become higher than 
the order available. We can use eq. (|165p to illustrate this point. Using the expansion 

the series for P^^{z,as) in reads 

p-(z, a.) = ± H77TTV 

The series in eq. (|168p may also be obtained by expanding eq. (|163p to infinite order in a^. For z > 0, eq. I|168p 
converges rapidly at sufficiently large r, but the value of r at which this convergence sets in increases with decreasing 
z. For example, for z — 0.01, the accuracy of the series reaches the level of a few percent only at r = 7. In any case, 
higher order calculations in the FO approach should at least extend the validity of the FO calculation of the evolution 
to lower z values. 

By using eq. (I163p for P in the DGLAP equation, eq. ([5]), the dependence of the multiplicity in eq. (|144p on s is 
found to be 

(n'*(s)) cx exp 



2VCJ 1 



/3o ^/aJ7) 



(169) 
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i.e. the calculation of the s dependence of the multiplicity is finite once the DLs have been resummed. Of course, the 
DL contribution from the gluon coefficient function has been neglected, but this is probably negligible relative to the 
LO quark coefficient function. 

We now discuss how eq. (|164l) was used in the phenomcnological studies in Ref. jll2j |. From eqs. (P7)) (with tagged 
quarks qj summed over) and ([29]) . the LO cross section for e+e~ reactions reads 

1 Hrr^ 1 

where {Q{s)) — X^jLi Qqji^) is the average electroweak coupling of quarks. Using eq. (|170p . fits to e+e" reaction 
data have been performed in Ref. jll2l |. The initial FFs are parameterized as 

D^{z,M^) = A^exp[-cln2z]z"(l - zf, (171) 

because at large z its behaviour is that in eq. ((56|) (with fi — 1), which has so far sufficed for global fits to large x 
data, while at small z eq. (|17ip reduces, for c > 0, to a Gaussian in In 1/z, 



— cln am — 

z z 



(172) 



which is the empirical behaviour of the cross section at small x — z. As will be shown at the end of subsection [ 
eq. (|172p is also the large Mq behaviour of the FFs predicted by the DLA / MLLA, with a < to ensure the center 
in Inl/x is at —a/ {2c) > 0. The value of a and of c is chosen to be independent of the FF, as dictated by the DLA 
result in eq. p57p . 

Data for unidentified hadron production at e~^e~' reactio ns o ver a large ra nge of values of ^/s was used, being 



composed of the sets from TASSO at = 14 35 44 GeV 11531 a nd 22 GeV [l5J, MARK II .155i[ and TPC [iM., 
at 29 GeV, TOPAZ at 58 GeV [HI, ALEPH [lH, DELPHI 1159^. L 3 flBn^ . OPAL [m^ and SLG 1162^ at 91 GeV, 
ALEPH 1163] and OPAL [l3| at 133 GeV, DELPHI at 161 GeVll39| and OPAL at 172, 183, 189 GeV [1645 and 202 
GeV [165] . These data also span a wide range in x, from large to small values, as is usually the case with inclusive 
single hadron production measurements. The level of accuracy of the calculations in Ref. jll2l ] was such that the 
possible contamination of these hadron unidentified data as discussed in section [3] was not considered a problem. 
However, such data are probably not suited to a NLO fit. The unresummed fit, i.e. using P — agP'^'^^ in eq. ([2]) fails 
at the small x values around and to the right of the maximum, as can be seen in Fig. [33] (left). On the other hand, 
the resummed i.e. using eq. I|164p for the splitting functions appearing in the evolution, of Fig. [33] (right) succeeds 
much better at these small x values, while the large x description remains intact. The choice M/ = ^/s was made, 
as well as the large value AIq = 14 GeV to ensure that the form of eq. (|172p predicted by the DLA at large Mq was 
valid. The fitted value of Aqcd in the unresummed fit was 388 MeV, which is fairly reasonable, but the resummed 
fit gave Aqcd — 801 MeV, which is somewhat larger than expected. This may not be serious since Aqcd carries a 
multiplicative error of 0(1): Multiplying Aqcd by a factor is equivalent to dividing the choices above for Mq and Mf 
by this factor, which by factorization scale independence and perturbative convergence is allowed provided that this 
factor is of 0(1). To further improve the small x description, hadron mass effects were accounted for in the manner 
discussed in subsection 15.11 In the case of the unresummed fit, the description of all fitted data was as good as the 
resummed fit of Fig. [33] (right), while the resummed fit did not improve substantially. However, the resummed fit 
gave the reasonable values Aqcd = 399 MeV and irifi — 252 MeV (assuming the hadron sample is composed mostly 
of TT^), while the unresummed fit gave the clearly unreasonable value Aqcd = 1308 MeV and perhaps slightly too 
large value nih — 408 MeV. Thus the conclusion from these analyses is that both DL resummation and treatment of 
hadron mass effects are needed in order to achieve a reasonable fitted value of Aqcd cind in order to describe the data 
down to small x values. 

The fact that eq. (|147p is expected to be valid for all ui including as w ^ suggests that the failure to describe the 
very small x data when both DLs and 0(l/s) effects are treated is due to the neglect of the complete SL contribution 
to the evolution and/or the complete DL contribution in the gluon coefficient function. There may however also be a 
discrepancy between the theoretical and experimental definitions of the cross section which become more important 
with decreasing x. 



5. The modified leading logarithmic approximation 



Equation ((^J) with P give n by eq. p64p reduces to the MLLA master equation [144| , differentiated with respect to 
the factorization scale [l66l ]. if p-SeE(o) p64p is replaced, in Mellin space, by its value at w = 0, i.e. 

P(w, a,) = asP^^°^ (w = 0) + P^^{l^, a.). (173) 
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FIG. 33: Fit to data using DGLAP evolution in the FO approach to LO (left) and using DGLAP evolution in the FO approach 
to LO but with DLs resummed (right). The variable —Inl/x. Only some of the data sets used for the fit are shown, together 
with their theoretical predictions from the results of the fit. Data to the right of the horizontal dotted lines have not been used 
in the fit. Each curve is shifted up by 0.8 for clarity. 



In discussions of the MLLA, 



P^(o)(c. = 0) 



-3Cf 



(174) 



is usually referred to as the SL term, as discussed after eq. (|143l) . At small w, the accuracy of eq. (|173p is similar 
to the accur acy of eq. (|164p . while for all w, eq. (|173p is still a better approximation for P than P — P^^ is. By 
coincidence jl43j , eq. (|173p leads to a similar result at large z as eq. (|164p or ^ do. In any case, the approach of eq. 
P64p . or more generally eq. p45p . incorporates more information than the MLLA, in particular the large uj region, 
and is therefore more accurate. 

In many applications of the MLLA, eq. I|157p is used. Together with the gluon component of eq. pSSp with 
D = {Ds,Dg), it implies that the gluon FF evolves according to 



(175) 



where, writing 



E'gg{u,as{Mf),as{M^))=e^p 



M=M; 



M=Mo 



d\nM}P'gg{uj,a,{M})) 



(176) 



the splitting function P' is given by [14 
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(177) 



As for P^^, P' (uijas) is finite as w — > 0. Furthermore, eq. (|157p reduces eq. (|170p to 



1 da'' 
(t(s) dx 



(x,s 



2Cf 



(178) 
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We have derived these MLLA resuhs using the resuhs of the DLA and the FO approximation, but they were first 
derived as a correction to the derivation of the DLA using the generating functional technique mentioned in subsection 
El 

It is often convenient and simpler to study the moments of the cross section. The nth moment of a function f{z) 
is given by 



(179) 



Kn= y-^J In/M 

According to eq. (|175p . the M'j dependence of the nth moment of f{z) — D{z, M'j) is given by 

Kn{M]) = AKMM]),a,{MS)) + K„{M^), 
where AKniasiMj), as{M§)) is the nth moment of £'gg(z, as(M|), as(Mo )), given by 



M=Mf 



(180) 



(181) 



JM=Mo \ "-^ J 

which from eq. (I177|) gives, for n > 1, 

AKn{a,{Mj),a,{Ml)) = a;"^ {M}) + CWaf (M|) + O (a,)) - {as{M]) ^ as{M^)]. (182) 



The explicit results for the Ci^'^-* can be calculated from eq. (|177p . and are presented in Ref. 167 1 for the first few 
values of n. For n > 3 and odd, Cn^ — 0. Equation p82p also applies for n — 0, but with the presence of a term 
proportional to In . 



6. Local parton-hadron duality and the limiting spectrum 

Perturbative QCD is incomplete in that it cannot describe the physics of hadrons entirely. An intuitive solution 
to this problem is provided by the LPHD, which states that the distribution of partons in inclusive processes with a 
sufficiently low energy scale is similar to the distribution of hadrons, up to the number of particles actually produced 
(the multiplicity). This implies that the hadronic FF is proportional to the partonic FF at a factorization scale of 
O(Aqcd)- In other words, assuming that eq. (|157p is valid at such low scales, we must ensure that our initial gluon 
FF obeys 

Dg{z, M^) = N6{1 - z) (183) 

if we choose Afo — O(Aqcd)- Ordinarily, evolution at such low scales will not be possible in perturbation theory, be- 
cause the convergence of the perturbation series is spoilt, and can even be singular. However, by using hypergeometric 
functions, it is possible to write the MLLA evolution of Z3g, and therefore the s dependence of the cross section in eq. 
(|178p . such that the calculation is well defined at Mf = Aqcd- This calculation is known as the limiting spectrum. 
In this subsection we show how the limiting spectrum and the result of the LPHD in eq. (|183p arise as accidental 
consequences of truncated perturbation theory. 

According to eq. (|182p and the result as(M^) — > as Af — ^ oo, for sufficiently large My we may neglect if„(Mg) 
relative to AKn{as{Mj),as{MQ)), except for n = because the cross section is very sensitive to = exp[Ko{MQ)]. 
From eq. ^T7^ with f {z) = D{z, M^) and Kn{M^) = for n > 1, we see that D{z, M^) takes the form in eq. ([TF5)) . 
In other words, at high energies, it appears as if the initial FF is a delta function, even though it could be a very 
different function. 

The second, Mq dependent, part of if„ eq. (|182p may also be neglected relative to the first, Mf dependent, part for 
sufficiently large Mf, because in this limit as{M'j) <C as(Afg). Coincidentally, because as(M^) ^ oo as M ^ Aqcd, 
the neglect of the second, Mq dependent, part of Kn eq. (I182p is equivalent to choosing A/q = Aqcd, provided that 
the series in aV^ is terminated at 0(a"^^), because the next term will be finite and the terms following that will be 
singular. In other words, at high energies, it appears as if the choice Mq = Aqcd is justified, even though it is in fact 
not. 

We conclude that the LPHD and limiting spectrum in the context of hadron production can only be verified using 
low energy data, because their consequences at high energy are also implied by perturbation theory alone. 



63 



With the two approximations above, eq. pSOp finally becomes 

Kn{Mj) « AKn{asiMj),oo) for n > 1, (184) 

which are the moments of the limiting spectrum. Equation (|184p implies an interesting large Mf behaviour of the 
gluon FF, and hence, from eq. (|157p . of all quark FFs. The inverse Mellin transform of the gluon FF, 

zDg{z,M]) ^ ^ I dujexp[Lu^]Dg{uj,Mj), 
Zm J(j 

where ^ = ln(l/z), may, by making the replacement y — iua, where 



K2{M'i), be written 
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where = exp[i^o(A^/)] (so that ^{M^) ^ N in eq. (fT^ ). where 5 = (C - O/o- with ^ ^ Ki{Mj\ 
value of ^, where the real quantity R is given by 



(185) 
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and where 



as' (Mf) l + 0[a]{Mf) 



(187) 



(188) 



Equation (|182p has been used for the second equality in eq. (|188p . Because of that property, we can expand the 
K„-dependent exponential in eq. (|187p in powers of the k„ up to the required accuracy and perform the integral for 
each term. Then, writing R as an exponential of the form 



R = exp 



.1=0 



(189) 



we find that the series in the exponent will terminate at a finite value of i when it is expanded in Os to some finite 
order, even ii d = 0{1). In particular, if «;„ = for n > 3, which is always a reasonable approximation because of eq. 
()188p . then all = so that R = 1. Therefore, from eq. (|186p . the gluon FF at large Mf and therefore the cross 
section at large ^/s will be a (distorted) Gaussian over the range fro m lar ge to small x. This justifies the choice for 
the small z behaviour of eq. (|172p used for the FFs in the fits of Ref. [ll2| . 



9. OUTLOOK 



1. Possible experimental results for the future 



Very accur ate m easurements of light charged and neutral hadron production at HERA are now possible. As noted 
in Refs. [ol, Il68l |. such data in which the hadron species is identified, which does not yet exist, would make a 
significant improvement to the current knowledge of FFs. Furthermore, together with all the similar data from e^e^ 
reactions already used in global fits, the flavour separation of the FFs for each hadron species could be significantly 
improved, and the validity of the quark tagging often performed in measurements of e~^e~ reactions can be studied. 
By identification of the charge-sign of the produced hadrons as well, perhaps the greatest improvement to the current 
constraints on FFs that HERA could provide are on the valence quark FFs, which at present are only constrained by 
similar but relatively much less accurate data from RHIC. Furth er ac curate measurements of light charged hadron 
production in e+e~ reaction data from BaBar at ^/s = 10.54 GeV jl42j |. shown in its preliminary version in Fig. (j34p . 
is expected to be published in the next year or so, which will provide much needed constraints on FFs at large z that 
are not provided by any current data sets. Because these data are taken at a different ^/s to that of the currently 
most accurate data, which is at ^/s — 91.2 GeV, such data will also significantly improve the constraints on as{Mz)- 
In addition, new more accurate pp reaction data is being taken at RHIC which includes larger pT values than before, 
and hence will also improve the large z constraints. 

In the further future, accurate measurements in e+e^ reactions may be taken at CLEO, Belle and LEP. Valence 
quark FFs could also be constrained by measurements of the asymmetric cross section in e^e~ reactions, but to a 
much lesser degree than by measurements of charge-sign identified data from ep reactions. Further ep reaction data 
could be taken at the proposed eRHIC and LHeC colliders. Finally, measurements of pp reactions at the LHC would 
likely be of sufficiently high accuracy to significantly improve the FF constraints. 
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FIG. 34: Preliminary data from e'^e reactions at BaBar for tt , K and p/p. Also shown are some published data for 
comparison. From Ref. [20|]. 



2. Future theoretical input to global fits 



A number of improvements which global fits may benefit from are possible, and we list those which are most likely 
to be available first: 

1. The inclusion of heavy quarks and heavy quark masses in the partonic cross sections. 

2. The application of large x resummation to pp reaction calculations for a finite rapidity range, and also for 
ep reactions both with and without cuts in px- 

3. The calculation of the NNLO off-diagonal splitting functions, as well as the NNLO partonic cross sections 
in ep and pp reactions. 

4. The calculation of the full DL contribution to the gluon coefficient function in e+e" reactions data, and 
possibly further SGLs here and in the splitting functions, with the incorporation of these improvements in 
global fits to e+e~ reaction data including more of the data at small x that have previously been excluded. 
The resummation of the same classes of SGLs in other types of processes may also be beneficial, if there 
exist measurements of those processes whose calculations would improve due to the resummation. 

Of course, a simultaneous treatment of all the above improvements in a single global fit would be an additional 
goal, but one which is further into the future. 

Global fits of FFs can be further improved by fitting other quantities not determinable in perturbation theory, such 
as hadron mass, higher twist contributions, intrinsic FFs, fracture functions and the modifications associated with 
hadronic decay channels, whose results in the case of the latter four quantities would contribute significantly to the 
current knowledge of QCD physics. 
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10. SUMMARY 

We have given a comprehensive account of the most important methods and resuhs in the phenomenological 
extraction of FFs from experimental data. Many FF components for many particles are now very well constrained. 
FFs have been useful in the study of other phenomena and in the extraction of other physical quantities, of which 
some examples are given in section [T] and the number and intensity of such applications is expected to increase as the 
constraints on FFs improve. The best constraints on FFs at present come from e+e~ reaction data However, these 
data do not constrain the valence quark FFs (or charge-sign asymmetry FFs), and to a large degree the gluon FF, 
which can be at least weakly constrained by current data from RHIC. Theoretically, calculations of inclusive single 
hadron production in all reactions that have been performed experimentally can be calculated to NLO, and the effects 
of large x resummation in e+e~ reactions and heavy quarks effects in all reactions can be included at this level of 
accuracy. The deviation of the fitted detected hadron mass from its true value gives a quantification of the importance 
of the production of this particle from decays of heavier hadrons instead of from direct partonic fragmentation. In 
global fits, systematic errors are now accounted for through a covariance matrix, and various sound approaches for 
propagating experimental errors from measurements to predictions have been applied. 

However, there is alot of room for improvement in this program, perhaps the most important being the eventual 
incorporation in global fits of future accurate measurements of hadron production at HERA in which both the species 
and charge-sign of the hadrons is measured, in order to significantly improve the constraints on the differences between 
FFs of different quark flavours (the non singlet quark FFs) and of different quark charge-signs (the valence quark 
FFs) respectively. Such FF components are currently constrained by theoretically sound but experimentally untested 
non perturbative assumptions. A significant improvement in the extraction of as{Mz) would also result from such 
fits relative to previous fits. Because of the theoretical similarity between charge-sign unidentified data from e^e^ 
and ep reactions, further tests on FF universality would also result from the inclusion of hadron species identified 
data from HERA in global fits together with data from e+e~ reactions. 

Note that FF universality tests are provided by the inclusion of pp{p) reaction data, but, because of their much 
lower accuracy, to a much lesser degree than by the inclusion of ep reaction data. At present, pp{p) reaction data are 
crucial for the extraction of the currently very badly constrained gluon and valence quark FFs, and therefore future 
accurate measurements from RHIC and the LHC will be most welcome. We note that the constraints on these FFs 
could also be significantly improved by measurements of, respectively, the longitudinal and asymmetric cross sections 
(with hadron species identification, of course) in e"'"e~ reactions. 
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APPENDIX A: DERIVATION OF THE QCD FACTORIZATION THEOREM 

In this appendix, we discuss the factorization theorem applied to inclusive single hadron production. We will restrict 
our discussion to the process e+e^ ^7* h + X of Fig. [U although it can be applied to other inclusive single hadron 
production processes. This cross section can be decomposed as 

da = L^-'W^,, (Al) 

where L^'^ is the well known tensor describing the process e+e^ 7* (we work to LO in QED), while the tensor 

W^.{q,Ph) = ^J d^x^'^-^0\J^{x)al{ph)\X){X\ah{ph)MOm, (A2) 

where a'j^'^ (ph) is the annihilation (creation) operator for a hadron h with momentum ph (we do not assume a massless 
hadron in this appendix), describes the hadronic process 7* — > /i -1- A" and is the quantity in which we are interested. 
W may be partially calculated using perturbation theory by decomposing it into the form (omitting the spacetime 
indices jiu for brevity from now on) 

W ^wD + r, (A3) 
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where the remainder r is power suppressed, i.e. 

where w is a vector of the equivalent, factorized, full partonic processes Wi (where the parton species label i includes 
(combinations of) the spin state(s) and charge(s) as discussed in subsection 12. 6p for the processes ^* ^ i + X, and 
where D is a vector of factorized FFs Di. Equation (|A3p is then the same as eq. H]). The "product" wD involves sums 
and integrations over all species of the "detected" final state parton of w, which is a real particle moving spatially 
parallel to the detected hadron. Note that the momentum integration is precisely the convolution of eq. ([T|), i.e. over 
the momentum fr actio n z, and not over all four components of momentum. In the language of the operator product 
expansion (OPE) [169| . eq. (|A3p corresponds to a twist expansion, the term wD being referred to as the leading twist 
component. The variable p > 1 in eq. (|A4p . so at large enough energy the higher twist terms are smaller than the 
radiative corrections of 0(1/ In y^). 

In the remainder of this appendix, we briefly but comprehensively outline the formal derivation of the factorization 
theorem of R ef. l33l| . in which full details and more references may be found. In subscction lA.A.ll we outline the steps 
given in Ref. [33| for separating the non leading twist part, whose order of magnitude can be reliably estimated, from 
the cross section such that it is suitable for factorization. In subsection lA.A.2l we discuss the factorization approach of 
Ref. (ssj . which is essentially a generalization of former appr oaches. Then, in subsection lA.A.31 we attempt to connect 
it with the older factorization approach of Refs. [H, Il7d | . Because the manner in which a parton should be treated 
in factorization depends on whether its mass is much greater than the hard scale, diff'erent schemes are appropriate 
for different energies and therefore, in subsection lA. A.5l we discuss matching conditions between quantities in these 
different schemes. However, we note there that the correct treatment of quarks with mass much greater than the hard 
scale has not been specified in the literature so far, and we indicate how this may be remedied. Finally, in subsection 
IA.A.6[ we summarize the open issues remaining in the factorization theorem. 



A.l. Twist expansion 



For now, let us assume that all partons have masses less than or of the order of the hard scale ^/s. We will return 
to the case that there are also quarks with masses much greater than -^s in subsection I A. A.4l In Ref. [s^], the form 
of eq. (|A3P is derived for DIS, eh ^ e + X (in which case D is a vector of PDFs), by starting from an expansion 
in graphs which are two-particle irreducible (2PI), i.e. cannot be disconnected by cutting through two internal lines, 
in th e t-channel, and then using the result that all graphs that contribute at leading twist are two-particle reducible 
[l7ll |. There are many subtleties involved in getting to this result, and the reader is referred to subsection IV A of 
Ref. [3^ for details and references. By repeating these arguments but with the negative virtuality of the exchanged 
boson replaced with positive virtuality and the incoming hadron replaced with an outgoing hadron, one obtains the 
factorized form of the cross section e'^e~ — s- 7* — > /i -I- The expansion is shown in Fig. [35] which, using the 
shorthand 1 + Kq + Kq + . . . = 1/(1 — Kq), may be represented as 

W = Co-^-^To + B. (A5) 

As it stands, eq. (jA5[) is no use for practical calculations. Firstly, Tq and B contain hadron legs for which no 
perturbative or other analytic representation exists. Secondly, perturbation theory cannot be applied even to calculate 
the equivalent purely partonic cross section Cq/{1 — Kq): In general, each parton species contributes potential mass 
singularities, which are logarithms of each parton mass that becomes singular as that mass approaches zero. The 
potential mass singularities of any quark, then, will not be singular if all quarks are taken to be massive, but they 
will be large if this quark's mass is much less than ^/s, i.e. in the limit that this mass be neglected, at the expense 
of a relative error of 0{mf / s) on the cross section as dictated by the decoupling theorem [2T|. Because the magnitude 
of potential mass singularities increases with the order in a^, for sufficiently large s they will cause the perturbative 
calculation of any part of W to diverge. Thirdly, the products in eq. (jA5[) are rather complicated, containing sums and 
integrations over all virtual partons connecting 2PI graphs, which we will call connecting partons. To tackle the second 
and third problem, we begin with the observation that, in a physical gau ge such as the light-cone gauge, which we use 
from now on, the 2PI graphs are free of potential mass singularities |l70j | , which therefore arise from those connecting 
partons which are in the vicinity of being real, i.e. on shell and with physical spin, and moving spatially parallel to the 
detected hadron. These 2PI graphs are also free of UV singularities after renormalization of the strong coupling and 
the quark masses. Thus we introduce a projection operator Z = Z'^ which projects onto the connecting partons in a 
manner which includes all those partons with momenta responsible for the potential mass singularities. The operator 
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FIG. 35: 2PI f-channel expansion of the cross section for e^e~ — > 7* ^ /i + X (with the electron and positron legs removed for 
simplicity). The left hand side is the modulus squared of the amplitude shown in Fig.[T] i.e. the external wavy legs represent the 
virtual photon 7* and the external double lines the produced hadron h. On the right hand side, the lines between 2PI graphs 
are virtual partons in loops (i.e. summed and integrated over all quantum numbers). Co and Kq represent all 2PI t-channel 
contributions to the processes 7* — > j + X and i ^ j + X respectively. B and To represent all 2PI t-channel contributions to 
^* —> h + X and j —> h + X, where in this case "2PI" must be defined by non perturbative external field methods. Strictly 
speaking, all graphs have a cut down the middle signifying the final state and on which all particles must be real. Alternatively, 
we may forget the cut if these graphs may be regarded as forward amplitudes, whose imaginary component also gives the cross 
section as dictated by the optical theorem. 

{1 — Z) annihilates the potential mass singularity between any two 2PI graphs. Let us now give one explicit definition 
of such an operator. We use light cone coordinates V = {V+ = {V° + V^)/V2, V- = (y° - V^)/V2, Vt = {Vi, V2)), 
and work in a frame for which the detected hadron's momentum is given by eq. ((84|) and the virtual boson's momentum 
by eq. (|83p . Considering only massless partons for simple illustration, mass singularities arise from connecting partons 
with physical spins and physical momenta obeying 

fc=(^^,0,oy (A6) 

Then 

^o«';/3,;3'(fc,0 = ^T-a'T^;,' (27r)4<5(fc+ - /+ )J (fc" (fc^) (A7) 

and something similar for gluons. Note that eq. (jA7p is consistent with the projection operator property Z = Z^. The 
extension of eq. (jA7[) to massive quarks (we assume all quarks' and the detected hadron's masses are non negligible) 
is quite straightforward, in particular the "— " component of eq. (|A6|) becomes zmf / {2p^) and the mass singularities 
become potential mass singularities. Like Kq, Z can be regarded as a graph with two partons at the top, each 
with momentum fc, and two with momentum / at the bottom. As illustrated in Fig. 1361 the insertion of Z between 
two graphs reduces the sums and integrations in the product between them to an integration over z of the pair of 
connecting partons at the bottom of the top graph, which have momenta given by eq. (jA6p . i.e. they are on-shell, and 
an integration over the momenta and It of the pair of connecting partons at the top of the bottom graph, whose + 
component of momentum is the same as that in eq. (jA6p . i.e. Z+ = Ph This is the case, for example, in the product 
between wD in eq. (|A3p — in practice it is usually not necessary to show explicitly the and It integrations over 
the initial state partons in D. 

The Z operator is used to put eq. (|A5p into the form of eq. (jA3p in subsection V B of Ref. [l^ , where the remainder 
r is shown to be power suppressed (i.e. to obey eq. (|A4p ). A general graph, such as W, containing a hard and a 
soft scale such as and Aqcd respectively will in general contain a leading region, i.e. a part which is not power 
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FIG. 36: The action of Z on the connecting partons of mass rrii between any graphs A and B. The momentum integration is 
over the variables and It- 



suppressed, coming from graphs which can be expressed as a graph coupled to the hard scale, the hard grap h joined 
to a graph coupled to the soft scale, the soft graphs by 2 partons of mass much less than the hard scale [l7ll |. Being 
2PI, B docs not contain such graphs and therefore does not contain a leading region, i.e. it is power suppressed. This 
will not be the case in a general gauge, because additional gluon exchanges between the hard and soft graphs are 
allowed. However, it is assumed, without proof, that a suitable light-cone gauge exists such that such graphs are 
power suppressed. This is true for the = gauge that we have been using, at least in the case that these gluons' 
momenta are almost coUinear with the detected hadron's. In the leading region, lines in the hard graph will have 
much larger virtualities than lines in the soft graph, so lines in a 2PI graph must have similar virtualities. Therefore, 
because each (1 — Z) insertion between two graphs gives a suppression of order (highest virtuality in graph below / 
lowest virtuality in graph above)^ [s^, successive insertions of (1 — Z) in between all 2PI graphs in eq. (|A5p will give 
successive factors of this suppression leading an overall power suppression. Consequently, 

r = C.{l-Z)^-j^^^T, + B (AS) 

obeys eq. (|A4p . so r may be taken to represent all higher twist terms in W . All potential mass singularities in r 
are annihilated by the (1 — Z) insertions. All UV singularities in r cancel: Acting on partons from the left with Z 
introduces a UV singularity, which is then canceled by the 1 ~ Z factors in essentially the same way as these factors 
remove the potential mass singularities. 

The leading twist component of W is therefore W — r, which we will now factorize in order to express it in its 
familiar form in eq. (|A3[) . 



A. 2. The modern approach to factorization 



The approach to factorization of Ref. ,33,] that we consider in this subsecti on i s a generalization of previous ap- 
proaches such as the OPE. Most importantly, it implicitly uses the cut vertices [l72j | and nonlocal operators 141 1 that 
were necessary in modifying the Wilson expansion in local operators on the light cone, which can be applied to e.g. 
the time ordered product of currents appearing in ep —* e + X , to the time ordered product of currents appearing in 
e+e^ h + X, which is subtle due its nonlocal operator structure. The result that r = 0((Aqcd/Q)^), i-e. that p = 2 



m eq 
Ref. l3 




found in Ref. 



14lj using an expansion in nonlocal operators, should be obtainable using the approach of 
. Using the Z operator, eq. (|A5[) may be rewritten in the form 



W ■ 



wbDb- 



(A9) 
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In this expression, the vector 



WB = Co 



1 



1 - (1 - Z)Ko 



(AlO) 



contains the equivalent "UV bare" (containing UV divergences) partonic processes. It is clearly free of potential mass 
singularities. Note that the "detected" parton is real due to the presence of Z on the right hand side. The UV bare 
FFs 



1 



l-K, 



-To 



(All) 



appearing in eq. (|A9[) describe the fragmentation of partons. Clearly, Db contains only and all processes contributing 
to the process i h -\- X (i is in fact a virtual parton, but its + component of momentum is fixed to that of the real 
parton lines at the top of Z, while its remaining components are integrated ou t), and rnay t herefore be written as a 
matrix element of unrenormalized operators. In a general gauge, it is given by [43 . 1 1731 . Il74| 



dx e 



2 trcolor 2 ^^spi] 



7+(0|<H0,0,O)^exp 



*5 



(0) 



X 4(P+,0,O)a,,(P+,0,O)5^exp 



(0) 



dy-AW+(0,y~,O)C 



dy-AW+(0,2/-,0)C 



<\o,x-,o)|o)^. 



(A12) 



for a quark of flavour qi, where ■0^^ is the spinor field for quarks of flavour /, g is the strong coupling, ta — Xa/'i 
where Xa are the Gell-Mann matrices, and is the field for gluons of colour charge a. The operator ^ (^) orders 
the field operators A+(0,?/~,0) in the products in each term of the expansion of the exponential in g such that an 
operator is always to the left (right) of any other operator with a lower (higher) value of . The superscript "(0)" 
on e.g. ipq^^ implies that the object is unrenormalized. The expression for the gluon FF is similar, obtained essentially 
by replacing ipq^^ everywhere with the gluon field strength tensor. The Wilson lines are not present in eq. (jA12|) in 
the = gauge that we have been using, i.e. the exponentials are equal to unity. Their direct derivation involves 
going beyond the 2PI expansion in a non-trivial way. Since the UV divergences of the bare operators are subtracted 
by ordinary multiplicative renormalization, the renormalized FFs take the form 



D = GDb, 



(A13) 



where G = GZ renormalizes UV divergences in the operator in Db, in the standard way of operator renormalization 
(see e.g. |l75l |). and therefore introduces a dependence of D on some operator renormalization scale Alf. G is non 
singular as parton masses vanish, as will be explicitly shown later (in the sentence containing eq. (jA25|) ). Dimensional 
regularization is necessary to implement the CWZ schemes, in which case G will be singular as the number of 
dimensions approaches 4. As usual in operator renormalization, the dependence of G on Mf is governed by the 
renormalization group equation 



G^PG, 



(A14) 



where the components of the matrix P are the relevant perturbatively calculable anomalous dimensions for the 
operators in Db, and are known as splitting functions. Therefore from eq. (|A13[) . 



din A/2 



D = PD, 



(A15) 



which is the DGLAP equation. In studies using the OPE, eq. (jA15[) appears in Mellin space, where the Mellin space 
variable TV is an integer equal to the spin of the given operator in the OPE, which is equivalent to eq. (|A15P in z 
space that we have just derived by applying the inverse Mellin transform of eq. (jCSp . 
We finally arrive at eq. (|A3[) from eq. (|A9|) by making the identity 



wbG 



(A16) 



where G ^ is defined such that GG ^ = Z. As for wb, the factorized partonic processes w are also free of potential 
mass singularities. In practice, w can be calculated as follows: We may write 



Wq = WbKb, 



(A17) 
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where 

1 - Aq 

is precisely the equivalent partonic process in which the "detected" parton is real, which is clearly free of UV diver- 
gences, but contains potential mass singularities, and 

Kb = Z-^—Z (A19) 

1 - Aq 

are the bare FFs for partons to fragment to partons. The renormalized equivalent of these FFs is 

r = GKb, (A20) 

which is non singular as the regulator of UV singularities is removed, but is singular as any parton masses vanish. 
Note that Kb and F are written in Ref. (ssj as Abp and Ajip respectively. Then, instead of calculating w via eq. 
(jA16p . w may be explicitly calculated from wq according to 

w = woT~^, (A21) 

where F"-"^ is defined such that FF~^ = F~^F = Z . Equation (|A21[1 . which is equivalent to eq. (65) of Ref. [33] (after 
expanding in a^), makes it clear that the purpose of F is to remove from wq the potential mass singularities. In other 
words, each potential mass singularities in is canceled by counterterms in F. F may be chosen, through the choice 
of G and Z in eq. (|A20p . to depend on parton masses only through mass logarithms, which will be the case for the 
CWZ schemes. The precise form of this F (eq. (jA34p ) will be derived later. 

To summarize, w and the Mf dependence of D are perturbatively calculable. The components of w are the standard 
coefficient functions in the literature, up to electroweak couplings etc. Like the Db, the factorized FFs D in eq. (jA26|) 
are clearly universal, since all details of the initial state are contained in Cq. 



A. 3. Connection with the older approach to factorization 

Finally, we make some connections between the above approach and the earlier approach of Refs. [H, 0, Il70f . The 
latter approach was formulated for the case of massless quarks only, so we will need to make some modifications to 
account for massive quarks as well. In addition, our definitions here of bare FFs will need to differ from the earlier 
approach in order to obtain reliable results. We begin with an alternative to eq. (|A9[) which corresponds to the 
starting point of calculations in the literature (i.e. eq. ()A21|) ): 



where the "bare" FF in this case is 



W-r^woDo, (A22) 

which is free of UV singularities because the UV singularity introduced by acting from the left with Z is canceled by 
(1 — Z) insertions occurring to the right, similar to the cancellation of UV singularities in r discussed in the paragraph 
following its definition in eq. (|A8[) . We define a projection operator 3^ = which projects in the same way as Z 
on parton lines above (so that Z!^ = !^), and which projects in a similar way as Z on parton lines below except 
that, in contrast to Z, it becomes sufficiently suppressed for increasing momenta of these lines such that it does not 
introduce UV singularities. The scale at which this suppression sets in is called the factorization scale, which will 
be written Mf since it will turn out to be identical to the operator renormalization scale defined above. Thus, for 
example, we may choose 3^ (in the case of massless quarks for simple illustration) to be given by eq. (|A7p multiplied 
by a function /(fc^/Af/, fer/Af/) such that /(0,0) = 1 and /(oo,oo) = 0. A more explicit definition of (or /) is 
not required since an implicit definition will be given later. Note that, like Z , is flavour diagonal. As a projection 
operator, = 3^ , and insertion of 1 — ^ between two 2PI graphs annihilates the potential mass singularity due to 
the connecting partons. 

The momentum dependence of 3^ can lead to problem s with gauge invariance and with rapidity divergences |l76j | . 
Such problems were not considered in Refs. [3^ [i^ ri70| |. which was limited to the case of massless quarks. We will 
assume that some suitable choice for the remaining degrees of freedom in ^ exists such that these problems do not 
arise in the case of massive quarks. Further work is needed here to prove that such a ^ exists. 
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The dependence of F, which was introduced in eq. (|A20[) . on ^ is 

r = (l + ■^^OT^i^) ^' (^24) 

which can be interpreted as the FFs for hard partons fragmenting to on-shell soft partons. Using eq. (|A20|) . we may 
write 

G^(l + (^-Z)i^„^-^^l^)z, (A25) 

i.e. G is free of potential mass singularities because, in the [Z — ^) operator, those potential mass singularities 
projected out by Z are identical to those projected out by 3^. This behaviour was noted just after eq. (|A13p . 
According to eqs. (jA3p and (|A2ip . the factorized FFs defined in eq. (|A13p may also be written 

D = TDq. (A26) 

Using eq. (|A21|) . the hard partonic cross section in terms of ^ reads 

- = ^° l-(l-^)i.o ^- 

which is clearly free of both UV singularities and potential mass singularities. F = ZT = TZ is a function only of the 
ratio of the + component of light cone momenta of its initial and final partons, as well as of parton masses and indices. 
By giving this function explicitly, eq. (|A24[) defines 3^ implicitly. The simplest possible choice for F, which is the case 
for the CWZ schemes, is that for which the coefficients in the perturbative series for F are themselves finite series 
in In A/|/m|, where i runs over all partons. Although the gluon mass must be zero for renormalizability, connecting 
gluons can be given a small mass after renormalization has been performed within the 2PI graphs. Although the 
perturbative approximation for F fails in the numerical sense, its Mf dependence does not, because 

^ F = PF, (A28) 



dlnM| 



where the splitting functions 

^-(dh^^)^° l-(l-^)i.o ^ 

are clearly free of UV and potential mass singularities. This is obviously true since the perturbatively calculable w 
must obey a similar equation to eq. (|A28|) in order for eq. (|A3[) . in particular wD, to be independent of Mf. Thus, 
when F is chosen to depend on parton masses only through finite powers of potential mass singularities, as discussed 
in the paragraph preceding eq. (|A28p . P will be independent of all parton masses. The DGLAP equation, eq. (jA15[) . 
may be obtained from eqs. (jA26|) and (|A28[) . 



A. 4. Treatment of non partonic quarks 

Up to this point, all results are only valid when no parton mass is much greater than -y/s. Because the potential 
mass singularities of quarks that do not fall into this category are power suppressed by 0{s/mf), where rrii is the 
mass of any such quark, they must not be subtracted by a large counterterm, i.e. they must be treated differently. 
Thus partons must be distinguished according to how they are treated: the partons that are treated as discussed in 
the last 3 subsections are called active partons, while the rest, including in particular those for which 3> \/s, are 
called non partonic quarks. Note that a quark for which m.; — O(y^) can be treated as either active or non partonic, 
since its potential mass singularities and their corresponding counterterms are not large. We define the 2PI graphs 
to be 2PI in non partonic quarks as well. The choice of Z on non partonic quarks must be such that r remains both 
power suppressed and free of UV singularities, i.e. Z on non partonic quarks must be such that the arguments around 
eq. (IA8[) still hold. Choosing Z — when acting on non partonic quark lines as is done in Ref. [33| leads to UV 
singularities in r, and therefore is not a valid possibility. One possibility which is valid is to choose Z{k, I) when the 
magnitude of I is large to behave in the same way on non partonic quark lines as on active quark lines, i.e. to choose 
eq. (|A7[) to hold for non partonic quarks as well, thus guaranteeing the cancellation of UV divergences in r, while the 



72 



Z{k, I) on non partonic quark lines vanishes as I becomes small to both guaranteeing the power suppression of r and 
ensuring no large counterterms are introduced for potential mass singularities due to non partonic quark, which are 
suppressed or not large. In other words, Z on non partonic quark lines may be chosen in a similar way to how 13'' 
acting on active quark lines was in the paragraph following eq. (jA23[) . except that now f{k~/Mf, kx/Mf) is replaced 
with another function, g{k^ / fif, kx/ fJ-f) say, where /i/ is another arbitrary scale at which the suppression of Z acting 
on non partonic quark lines sets in, which obeys g(0, 0) = and g{oo, oo) = 1. In this case, Z would no longer be a 
projection operator, and r is now scheme and scale dependent, i.e. it depends on the number of flavours and on fif 
respectively. Neither of these two facts presents any problems. However, as noted in the paragraph preceding the 
paragraph containing eq. (|A24[) . introducing a momentum dependence into a projection operator can lead to problems 
with gauge invariance and with rapidity divergences, so further work is needed to prove that a Z exists for which 
these problems do not arise. 

For now, for simplicity, to avoid possible problems with gauge invariance and with rapidity divergences, and to 
ensure that r is free of UV singularities, we will choose Z on non partonic quark lines to be the same as Z on active 
quark lines (i.e. the massive quark equivalent of eq. (jA7P ). However, in this case r is no longer power suppressed. 
Then, in the presence of non partonic quarks, all results derived so far still hold, except that the sum over partons 
in all products such as that in eq. (jA3p includes non partonic quarks. In certain "physical" schemes, defined in 
subsection 12.71 to be schemes such as the CWZ schemes in which no subtraction is made on quantities which are 
free of UV singularities, G depends on non partonic quark masses only, and is singular as such masses vanish (recall 
from subsection I A . A . 2] that G is non singular as active parton masses vanish), while T is independent of non partonic 
quark masses. When acting on non partonic quark lines, ^ must be chosen such that it does not introduce large 
counterterms to cancel power suppressed potential mass singularities, e.g. can be chosen to vanish on non partonic 
quark lines. 

The other type of potential mass singularity that a non partonic quark contributes, namely logarithms of its mass 
rugj that become singular as uigj ^ y/s are absorbed into the strong coupling constant in e.g. a CWZ scheme. Note 
that, for active partons, for which mi ^ y/s, such logarithms are power suppressed by Oirnf/s). 

In calculations of light hadron production, it may or may not be possible to neglect the contribution of intrinsic 
fragmentation, namely the non partonic components in the product in eq. (|A3p . In the case of PDFs, according to 
the decoupling theorem those graphs that contain a non partonic quark i will be suppressed by a power of Aqcd /"t-i 
[2^ . which is therefore the relative error on the cross section due to the neglect of the intrinsic PDF of parton i. If 
such a suppression occurs also for FFs then, as for PDFs, Dn^+i can be neglected, in which case eq. (jA30p implies 
that D'^^j^i is determined entirely from the Di for i = 0, . . . ,n/. This component of -D^i^+i describes the extrinsic 
fragmentation of parton n/ + 1. However, the results of Ref. [2^ only apply to the case of local operators, such as 
those relevant to DIS, but not necessarily to FFs for which non local operators must be used, as was mentioned earlier. 
If the intrinsic fragmentation of a non partonic quark is deemed important (e.g. charm fragmentation in a 3 flavour 
scheme) , calculations would need to be made in which Z acting on non partonic lines takes the required form. In the 
meantime, a quark whose intrinsic fragmentation is large must be treated as an active parton, with an intrinsic FF 
that can be fitted to experimental data as is the case for the light partons, and thus ^/s cannot be too small relative 
to its mass. 



A. 5. Matching conditions 



Next we consider the relation, or matching conditions, between D, and w, in schemes that differ by the number 
of active partons. Taking all quantities above to be defined for Uf active quark flavours and using primes to denote 
quantities defined with + 1 active quark flavours, eq. (|A13p implies that the matching conditions for FFs and 
coefficient functions is, respectively, 



D' = AD. 



(A30) 



and 



(A31) 



where A — G'G ^ — FT ^ is a function only of the ratio of the + component of light cone momenta of its initial and 
final partons, as well as of parton masses and indices. Diagrammatically it is given by 



A = Z 



1 



1 

l-ifo(l-^) 



Z. 



(A32) 
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Note that A remains non singular as the regulator of UV singularities is removed, in contrast to G, and, as for 
G, contains no potential mass singularities due to active parton masses. G can be chosen such that A depends on 
non partonic quark masses only, and only through mass logarithms. This is the case in the CWZ schemes. Recall 
that some modification of our incomplete results in this subsection are needed to ensure that r is properly power 
suppressed. Again, similar to P above, if T is chosen to have the simplest possible form as discussed in the paragraph 
preceding eq. (|A28|) . the only parton mass that A can depend on is m„^+i. Since the series for Kq starts at 0{as), 
since — projects out a potential mass singularity from the n/ + 1th quark, and since n factors of Kq contain 
n — 1 pairs of connecting partons and therefore a product of n — 1 potential mass singularities, A in Mellin space and 
as a matrix in parton species takes the form 

A{N,Mj) = 1 + Y. E ^™^(^) 1^'" (A33) 

n=l m=0 "'*n/+l 

From this we obtain the simplest possible form for F: 

r = n 1 + E J2 ^1:^ (N) In" ^ . (A34) 

1=0 \ Ti=l m=0 * / 

In practice, the are chosen such that w in eq. (IA21|) is finite in the limit that active parton masses vanish 

(although these limits do not of course need to be taken in the actual cross section calculations). The CWZ scheme 
is obtained by choosing the Am'' such that w in eq. (jA21|) reduces to the MS scheme for a theory with Uf massless 
fiavours only in the limits that active parton masses vanish and non partonic masses approach infinity (which again 
do not need to be taken in actual applications). 



A. 6. Open issues 



We have given a brief summary of the current status of factorization which was developed in Ref. [33l| . This approach 
both generalizes earlier approaches and solves a number of issues therein. However, some issues remain. In particular, 
the modification that we have proposed here to the behaviour of Z on non partonic quark lines given in Ref. I33l. as 
well as our proposed relations between the results of Ref. [3^ and those of the earlier approach of Refs. [ssl. l43l . [l7C| in 
terms of need more explicit study in order to ensure that there are no problems with gauge invariance or rapidity 
divergences, which are a general consequence of momentum dependent projection operators. Furthermore, while it 
has been proved that graphs in which there is an exchange between the hard and soft graphs of gluons coUincar to 
the detected hadron are power suppressed, a proof is lacking for gluons which are not coUincar. 



APPENDIX B: LEADING ORDER SPLITTING FUNCTIONS 



The LO coefficients of the splitting functions are given by 



1- z 



+ 2-5(1 - z) , 



Pil\z) = 2Cp 

3(0) 



1 + (1-Z)2 



(Bl) 



P^'^{z) = Tnnfiz' + {l-zr), 
p(0)^z)^2cJ^-2 + z-z' 



1 - z 



where Tr = 1/2 and, for the color gauge group SU(3), Ca — 3 and Cp — 4/3. The [/(z)]_|- operation occurs often in 
perturbative calculations, and can be most usefully defined by its behaviour in a convolution (in which it will always 
appear) : 



/' - [/(z)]+ D (^) = r ^f{z) \d (^) - zDix)] D{x) I 
Jx z ^ \zJ z I \zJ J Jo 



''^ dz 



-zD{x) -D{x) / dzf{z) 



(B2) 
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for any functions f(z) and D{z). Supposing that /(z) may contain a singularity as z ^ 1 but nowhere else, eq. (|B2[) is 
non singular provided f{z) is less singular than 1/(1 — z)^, because the quantity in square brackets in the first integral 
on the right hand side falls to zero as fast as (1 — z) in this limit. On the other hand, without the []+ operation. 



eq. (|B2p is only non singular provided f{z) is less singular than 1/(1 — z). By choosing D{z) 



-N 



common factor 



-N 



removing the 



and then replacing the lower limit of the integration on the left hand side with 0, we find the 



Mellin transform of [/(z)] , to be 



([/(z)]^) (TV) = / dz(z^-i-l)/(z). 
Jo 



(B3) 



Transforming eq. (|Bip to Mellin space gives 



pg)(7V) = 2Cf 
PW(7V) = 2Ca 



1 



2 N{N + 1) 
+N + 2 
{N~ 1)N{N + 1)' 

+N + 2 
N{N +1){N + 2)' 
11 1 



2SiiN) 



(B4) 



1 



12 N{N- 1) {N+l){N + 2) 



-Si{N) 



where, for integer N, the harmonic sum 



^ 1 



(B5) 



fe=i 



Equation (jB5[) can be analytically continued to complex N ^ —1, —2, . . . [69| by making the replacement X]fc=i 
Sfe^i ~ J2k=N+i then making the replacement fc ^ fc — in the second sum. The result is 



N 



This sum converges, but rather slowly. For numerical work, Si{N) should be calculated using the result [6£ 

S,{N)^Si{N + r)^j2TT 



(B6) 



fc=i 



k + N 



(B7) 



that follows from eq. (IBSp. where r should be chosen such that | + r| is large, and then calculating Si{N + r) as an 
expansion in 1/{N + r) 177l | . 



APPENDIX C: MELLIN SPACE 



Any succession of convolutions, which may be written as 



Zi J^_^ Z2 J^^_^ Zn-1 \Zi J \Z2 



is converted by the Mellin transform, defined by the integral transformation 

fiN) = -z^/(z). 



into an analytically more manageable succession of products 

fiN)^h{N)MN)...UN), 



(C2) 



(C3) 
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which is most easily proved by applying the Mellin transform to an alternative form of eq. (jCip . 

f{z)^ / dzi dz2... dZnS{z - Z1Z2 ■ ■ ■Zn)fl{zi)f2{z2) ■ ■ ■ fn{Zn)- (C4) 
Jo Jo Jo 

The Mellin transform is invertible via the inverse Mellin transform 



/(^) = 7^ f dNz~^f{N), (C5) 



where C is a contour in complex A'' space which starts from a point at Im(A'') = —00, ends at a point at Im(A'') = 00, 
and passes to the right of all poles in f{N). According to Cauchy's theorem, the contour C may be deformed 
provided that it does not pass through any poles in the process. The numerical evaluation of the integration in eq. 
(jCSP converges fastest when a contour for which Re(— A^) ^ cx) is used, because then falls exponentially to zero 
along it. 

Often in a convolution of two functions, 



(?) 


=-/ 







dNz-''g,{N)g2{N), (C6) 



"1 dz' 



the analytic Mellin transform of one of these functions, gi say, may not be calculable. (Note that eq. (jC6|l includes 
the cases of multiple convolutions in eq. (jCl[) , since gi can be equated with a subset of the convolutions in eq. (jCl|) 
and 52 with the rest.) An example is the i^^^/j^ appearing in eq. ([79|) . In this case the Mellin transform of gi = F^-^i^^ 
may be performed numerically, but only after dealing with a subtlety: Equation (jC2p with f ^ gi implies that the 
second equality in eq. (|C6[) has a divergent contribution proportional to dN {z / z')^^ g2{N) whenever < z' < z, 
if the contour C is chosen as discussed at the end of the previous paragraph. However, inspection of the first equality 
in eq. (jC6|) reveals that /(z) is independent of gi{z') in this region. Thus 51(A) in the second equality in eq. (jC6p 
must be replaced with the modified Mellin transform 

g^{N-z)^ / ^z'%(z'). (C7) 

J z ^ 

It can sometimes happen that the analytic Mellin transform of the other function, 52, also cannot be obtained. 
Unfortunately, eq. (|C7p cannot be used to also calculate the Mellin transform of 52, because then the inverse Mellin 
transform does not converge: As well as eq. (|C7I) . assume a second result which is the same as eq. (IC7|) but with 
gi 52 and z' z" . Then eq. (jC6P has a divergent contribution proportional to /^^ dN{z/{z' z"))~^ from the region 
z'z" < z, which exists even though z' > z and z" > z. One solution is instead to approximate 52 in z space as e.g. an 
expansion in Chebyshev polynomials, and then analytically Mellin transform it. 
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APPENDIX D: SUMMARY OF INCLUSIVE SINGLE HADRON PRODUCTION MEASUREMENTS 

Only TT^, K^^ p/p, Kg and A/ A data which can be reasonably reliably calculated, and which are therefore suitable 
for global fits, are listed here. 



D.l. e'^e reactions 



TABLE 3: Summary of the measurements for inclusive 
single TT* production in e+e~ reactions. The column 
labeled data" gives the number of data for which 
X > 0.05. The last column refers to the normalization 
uncertainty on the data. 



TABLE 4; As in Tabled but for K^. 



Collaboration 


Tagging 


V 

(GeV) 


II 

data 


Norm. 

(%) 


TASSO fl78l 


untagged 


12 


5 


20 


TASSO fllSl 


untagged 


14 


10 


8.5 


TASSO fllSl 


untagged 


22 


1 


6.3 


HRS d79l 

J. J. J. VkJ 1 -L 1 t/ 


untagged 


29 


6 




TPC f55l 




29 


9 




TPC f55l 


c tflp'p'prl 


29 


9 




TPC f55l 


b tagged 


29 


9 




TPC f89l 


untagged 


29 


27 




TASSO fl78l 

_1_ J 1. kJ kJ V,./ 1 _L 1 


untagged 


30 


4 


20 


TASSO [90] 


untagged 


34 


10 


6 


TASSO [90] 


untagged 


44 


7 


6 


TOPAZ [157] 


untagged 


58 


8 




ALEPH [79] 


untagged 


91.2 


22 


3 


DELPHI [56] 


1 tagged 


91.2 


17 




DELPHI [56] 


b tagged 


91.2 


17 




DELPHI [56] 


untagged 


91.2 


17 




OPAL [58] 


u tagged 


91.2 


5 




OPAL [58] 


d tagged 


91.2 


5 




OPAL [58] 


s tagged 


91.2 


5 




OPAL [58] 


c tagged 


91.2 


5 




OPAL [58] 


b tagged 


91.2 


5 




OPAL [180] 


untagged 


91.2 


20 




SLD [181] 


1 tagged 


91.2 


28 




SLD [181] 


c tagged 


91.2 


28 




SLD [181] 


b tagged 


91.2 


28 




SLD [181] 


untagged 


91.2 


28 




DELPHI [182] 


untagged 


189 


3 




Total 






338 





Collaboration 


Tagging 


^/5 
(GeV) 


# 
data 


Norm. 

(%) 


TASSO [178] 


untagged 


12 


3 


20 


TASSO [113] 


untagged 


14 


9 


8.5 


TASSO [113] 


untagged 


22 


7 


6.3 


HRS [179] 


untagged 


29 


7 




MARKII [183] 


untagged 


29 


2 


12 


TPC [89] 


untagged 


29 


26 




TASSO [178] 


untagged 


30 


2 


20 


TASSO [90] 


untagged 


34 


5 


6 


TOPAZ [157] 


untagged 


58 


5 




ALEPH [79,J,58] 


untagged 


91.2 


18 


3 


DELPHI [56] 


1 tagged 


91.2 


17 




DELPHI [56] 


b tagged 


91.2 


17 




DELPHI [56] 


untagged 


91.2 


17 




OPAL [58] 


u tagged 


91.2 


5 




OPAL [58] 


d tagged 


91.2 


5 




OPAL [58] 


s tagged 


91.2 


5 




OPAL [58] 


c tagged 


91.2 


5 




OPAL [58] 


b tagged 


91.2 


5 




OPAL [180] 


untagged 


91.2 


10 




SLD [181] 


1 tagged 


91.2 


28 




SLD [181] 


c tagged 


91.2 


28 




SLD [m] 


b tagged 


91.2 


28 




SLD [181] 


untagged 


91.2 


28 




DELPHI [182] 


untagged 


189 


3 




Total 






286 
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TABLE 6: As in Table O but for T^g. 



TABLE 5; As in Tabled but for 



Collaboration 


Tagging 


(GeV) 


# 
data 


Norm. 

(%) 


TASSO fl78] 


untagged 


12 


3 


20 


TASSO fll3] 


untagged 


14 


9 


8.5 


TASSO fll3] 


untagged 


22 


9 


6.3 


HRS [1791 


untagged 


29 


7 




TPC [89] 


untagged 


29 


20 




TASSO fl78] 


untagged 


30 


3 


20 


JADE [184] 


untagged 


34 


2 


14 


TASSO [90] 


untagged 


34 


7 


6 


TOPAZ [157] 


untagged 


58 


5 




ALEPH [79, 158] 


untagged 


91.2 


18 


3 


DELPHI [56] 


1 tagged 


91.2 


17 




DELPHI [56] 


b tagged 


91.2 


17 




DELPHI [56] 


untagged 


91.2 


17 




OPAL [58] 


u tagged 


91.2 


5 




OPAL [58] 


d tagged 


91.2 


5 




OPAL [58] 


s tagged 


91.2 


5 




OPAL [58] 


c tagged 


91.2 


5 




OPAL [58] 


b tagged 


91.2 


5 




OPAL [180] 


untagged 


91.2 


10 




SLD [181] 


1 tagged 


91.2 


29 




SLD [181] 


c tagged 


91.2 


29 




SLD [181] 


b tagged 


91.2 


29 




SLD [181] 


untagged 


91.2 


29 




DELPHI [182] 


untagged 


189 


3 




Total 






289 





Collaboration 


Tagging 


\/5 
(GeV) 


# 
data 


Norm. 

(%) 


TASSO [185] 


untagged 


14 


8 


15 


TASSO [186] 


untagged 


14.8 


8 




TASSO [186] 


untagged 


21.5 


5 




TASSO [185] 


untagged 


22 


5 


15 


HRS [179] 


untagged 


29 


12 




MARK II [183] 


untagged 


29 


17 


12 


TPC [187] 


untagged 


29 


7 




TASSO [188] 


untagged 


33.3 


7 


15 


TASSO [185] 


untagged 


34 


13 


15 


TASSO [186] 


untagged 


34.5 


13 




CELLO [189] 


untagged 


35 


9 




TASSO [186] 


untagged 


35 


13 




TASSO [186] 


untagged 


42.6 


13 




TOPAZ [157] 


untagged 


58 


4 




ALEPH [158] 


untagged 


91.2 


16 


2 


DELPHI [190] 


untagged 


91.2 


13 




OPAL [58] 


u tagged 


91.2 


5 




OPAL [58] 


d tagged 


91.2 


5 




OPAL [58] 


s tagged 


91.2 


5 




OPAL [58] 


c tagged 


91.2 


5 




OPAL [58] 


b tagged 


91.2 


5 




OPAL [191] 


untagged 


91.2 


16 


6 


SLD [53 


1 tagged 


91.2 


9 




SLD [57] 


c tagged 


91.2 


9 




SLD [57] 


b tagged 


91.2 


9 




SLD [57] 


untagged 


91.2 


9 




DELPHI [182] 


untagged 


183 


2 




DELPHI [182] 


untagged 


189 


3 




Total 






252 





TABLE 7: As in Table [S] but for A/A. 



Collaboration 


Tagging 


(GeV) 


# 
data 


Norm. 

(%) 


TASSO [185] 


untagged 


14 


3 


20 


TASSO [185] 


untagged 


22 


4 


20 


HRS [192] 


untagged 


29 


12 




MARK II [193] 


untagged 


29 


15 




TASSO [188] 


untagged 


33.3 


6 


15 


TASSO [185] 


untagged 


34 


6 


20 


TASSO [194] 


untagged 


34.8 


9 


9 


CELLO [189] 


untagged 


35 


7 




TASSO [90] 


untagged 


42.1 


4 


9 


ALEPH [158] 


untagged 


91.2 


16 


4 


DELPHI [195] 


untagged 


91.2 


7 




OPAL [58] 


u tagged 


91.2 


5 




OPAL [58] 


d tagged 


91.2 


5 




OPAL [58] 


s tagged 


91.2 


5 




OPAL [58] 


c tagged 


91.2 


5 




OPAL [58] 


b tagged 


91.2 


5 




OPAL [196] 


untagged 


91.2 


12 




SLD [57] 


1 tagged 


91.2 


4 






c tagged 


yi.z 


A 

4 




SLD [57] 


b tagged 


91.2 


4 




SLD [57] 


untagged 


91.2 


9 




DELPHI [182] 


untagged 


183 


3 




DELPHI [182] 


untagged 


189 


3 




Total 






145 
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D.2. pp{p) reactions 



TABLE 8: As in Table |3] but for pp{p) reactions. In tlie 
case of the BRAHMS data, the values in brackets are the 
normalization errors below 3 GeV. 



TABLE 9: As in Table H] but for 7^=* 



Collaboration 


Rapidity 


(GeV) 


# 
data 


Norm. 

(%) 


BRAHMS fl97] 


y G [2.9, 3] 


200 


8 


11,7,8(13), 
2,1(3) 


y G [3.25,3.35] 


7 


PHENIX fl98] (tt") 


\v\ < 0.35 


200 


13 


9.7 


STAR [199] (tt") 


r? = 3.3 


200 


4 


16 


STAR [199] (tt") 


r? = 3.8 


200 


2 


16 


STAR [116] 


\y\ < 0.5 


200 


10 


11.7 


Total 






44 





Collaboration 


Rapidity 


(GeV) 


# 
data 


Norm. 

(%) 


BRAHMS [197] 


y G [2.9, 3] 


200 


8 


11,7,8(13), 
2,1(3) 


y G [3.25,3.35] 


6 


CDF \m iK°s) 


N <i 


630 


37 


10 


STAR [116] (A-g) 


\y\ < 0.5 


200 


9 


11.7 


Total 






60 





TABLE 10: As in Table |H1 but for p{p). 



Collaboration 


Rapidity 


(GeV) 


# 
data 


Norm. 

(%) 




y G [2.9, 3] 


200 


7 


11,7,8(13), 


BRAHMS [197] 


y G [3.25,3.35] 


5 


2,1(3) 


STAR [116] 


\y\ < 0.5 


200 


8 


11.7 


Total 






20 





TABLE 11: As in Table [H but for isTg. 



TABLE 12: As in Table |8] but for A/A. 



Collaboration 


Rapidity 


(GeV) 


# 
data 


Norm. 

(%) 


BRAHMS [197] 


yG [2.9,3] 


200 


8 


11,7,8(13), 
2,1(3) 


y G [3.25, 3.35] 


6 


CDF [140] 


\v\ < 1 


630 


48 




STAR \n6] 


\y\ < 0.5 


200 


9 


11.7 


Total 






71 





Collaboration 


Rapidity 


(GeV) 


# 
data 


Norm. 

(%) 


CDF [140] 


\v\ < 1 


630 


34 


10 


STAR [117] 


\y\ < 0.5 


200 


9 


11.7 


Total 






43 
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